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1 croTACTcr tcagcctgat gtcaaaagca aaagttcaga agttcctcat 

51 CAATAAGGAG TCCTTGTGAG CAGGTGAAGC TCATCTAACT AGGCATTTCT 
101 ATGATGTGGC TGCTTTTAAC AACAACITGT TTGATCTGTG GAACTTTAAA 
151 TGCTGGTGGA TTCCTTGATT TGGAAAATGA AGTGAATCCT GAGGTGTGGA 
201 TGAATACTAG TGAAATCATC ATCTACAATG GCTACCCCAG TGAAGAGTAT 
251 GAAGTCACCA CTGAAGATGG GTATATACTC CTTGTCAACA GAATTCCTTA 
301 TGGGCGAACA CATGCTAGGA GCACAGGTCC CCGGCCAGTT GTGTATATGC 
351 AGCATGCCCT GTTTGCAGAC AATGCCTACT GGCTTGAGAA TTATGCCAAT 
401 GGAAGCCTTG GATTCCTTCT AGCAGATGCA GGTTATGATG TATGGATGGG 
451 AAACAGTCGG GGAAACACTT GGTCAAGAAG ACACAAAACA CTCTCAGAGA 
501 CAGATGAGAA ATTCTGGGCC TTTAGTTTTG ATGAAATGGC CAAATATGAT 
551 CTCCCAGGAG TAATAGACTT CATTGTAAAT AAAACTGGTC AGGAGAAATT 
601 GTATTTCATT GGACATTCAC TTGGCACTAC AATAGGGTTT GTAGCCTTTT 
651 CCACCATGCC TGAACTGGCA CAAAGAATCA AAATGAATTT T GCCTTGGG T 
701 CCTACGATCT CATTCAAATA TCCCACGGGC Al I I I IACCA GGTTTTTTCT 
751 ACTTCCAAAT TCCATAATCA AGGCTGTTTT TGGTACCAAA GGTTTCTTTT 
801 TAGAAGATAA GAAAACGAAG ATAGCTTCTA CCAAAATCTG CAACAATAAG 
851 ATACTCTGGT TGATATGTAG CGAATTTATG TCCTTATGGG CTGGATCCAA 
901 CAAGAAAAAT ATGAATCAGA GTCGAATGGA TGTGTATATG TCACATGCTC 
951 CCACTGGTTC ATCAGTACAC AACATTCTGC ATATAAAACA GCTTTACCAC 
1001 TCTGATGAAT TCAGAGCTTA TGACTGGGGA AATGACGCTG ATAATATGAA 
1051 ACATTACAAT CAGAGTCATC CCCCTATATA TGACCTGACT GCCATGAAAG 
1101 TGCCTACTGC TATTTGGGCT GGTGGACATG ATGTCCTCGG AACACCCCAG 
1151 GATGTGGCCA GGATACTCCC TCAAATCAAG AGTCTTTCAT TAGTGCTAAG 
1201 CCTATTGCCA GAATGGGAAC CCACCTTTGA TTTTGTCTGG GGCCTTGATG 
1251 CCCCTCAACG GATGTTCAGT GGAAATCATA ACCTTTAATG AAGGCATATT 
1301 TCCTAAATGC CAATGCATTT TACCTTTTTC AATTTAAAGG TTGGTTTCCA 
1351 AAGCCCTTAC 
(SEQ ID NO: 1) 



FEATURES : 

5'UTR: 1-100 

Start codon: 101 

Stop Codon: 1286 

3'UTR: 1289 



Homologous proteins: 
Top 10 BLAST Hits: 

CRA 1 18000004922653 /al ti d=gi 1 7434997 /def=pi r 1 1 G01416 1 ysosomal ... 431 e-120 

CRA 1 18000004903706 /al ti d=gi 1 542751 /def=pi r 1 1 S41408 1 ysosomal ... 430 e-119 

CRA | 18000004924799 /altid=gi 14557721 /def=ref |NP_000226. 1| lipa... 428 e-119 

CRA | 98000043616611 /altid=gi | 12844223 /def=dbj | BAB26283.il (akO. . . 415 e-115 

CRA 1 98000043617058 /al ti d=gi 1 12845127 /def=db j | BAB26629 . 1 1 (AKO . . . 415 e-115 

CRA 1 98000043616593 /al ti d=gi 1 12844194 /def=db j | BAB26272 . 1 1 (AKO ... 414 e-115 
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CRA 1 98000043617174 /altid=gi 112845372 /def=dbj |BAB26725.1| (AKO. 

CRAI98000043617140 /a"ltid=gi 112845298 /def=dbj | BAB26697.il (AKO. 

CRA | 98000043617224 /altid=gi 112845477 /def=dbj |BAB26766.1| (AKO. 

CRA| 98000043616955 /altid=gi 112844939 /def=dbj | BAB26556.il (AKO. 

EST : 

gi 18003062 /dataset=dbest /taxon=960. . . 
gi 18000757 /dataset=dbest /taxon=960. . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gi 18003062 Stomach normal 
gi 18000757 Stomoach normal 

Tissue expression: 
Human leukocyte 
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1 MVWLLLTTTC LICGTLNAGG FLDLENEVNP EVWMNTSEII IYNGYPSEEY 
51 EVTTEDGYIL LVNRIPYGRT HARSTGPRPV VYMQHALFAD NAYWLENYAN 
101 GSLGFLLADA GYDVWMGNSR GNTWSRRHKT LSETDEKFWA FSFDEMAKYD 
151 LPGVIDFIVN KTGQEKLYFI GHSLGTTIGF VAFSTMPELA QRIKMNFALG 
201 PTTSFKYPTG IFTRFFLLPN SIIKAVFGTK GFFLEDKKTK IASTKICNNK 
251 ILWLICSEFM SLWAGSNKKN MNQSRNDVYM SHAPTGSSVH NILHIKQLYH 
301 SDEFRAYDWG NDADNMKHYN QSHPPIYDLT AMKVPTAIWA GGHDVLGTPQ 
351 DVARILPQIK SLSLVLSLLP EWEPTFDFVW GLDAPQRMFS GNHNL 
(SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASNLGLYCOSYLATTON 

N-glycosylation site 

Number of matches: 5 

1 35-38 NTSE 

2 100-103 NGSL 

3 160-163 NKTG 

4 272-275 NQSR 

5 320-323 NQSH 



[2] PDOC00005 PS00005 PKC_PHOSPHO_3rTE 
Protein kinase C phosphorylation site 

Number of matches: 4 

1 125-127 SRR 

2 204-206 SFK 

3 243-245 STK 

4 266-268 SNK 



[3] PDOC00006 PS00006 CK2_PHOSPHO_33TE 
casein kinase II phosphorylation site 

Number of matches: 8 

1 53-56 TTED 

2 130-133 TLSE 

3 132-135 SETD 

4 142-145 SFDE 

5 162-165 TGQE 

6 185-188 TMPE 

7 274-277 SRMD 

8 348-351 TPQD 
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[4] PDOC00007 PS00007 TYR_PHOSPHO_SrTE 
Tyrosine kinase phosphorylation site 

161-168 KTGQEKLY 



[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 

1 14-19 GTLNAG 

2 117-122 GNSRGN 

3 121-126 GNTWSR 

4 175-180 GTTIGF 



[6] PDOC00110 PS00120 LIPASE3ER 
Lipases, serine active site 

167-176 LYFIGHSLGT 

Membrane spanning structure and domains: 
Helix Begin End Score Certainity 
13 23 1.398 Certain 

2 167 187 1.637 Certain 

3 248 268 0.715 Putative 



BLAST Alignment to Top Hit: 

>CRA 1 18000004903706 /altid=gi 1 542751 /def=pi r 1 1 S41408 lysosomal acid 
lipase (EC 3.1.1.-) / sterol esterase (EC 3.1.1.13) 
precursor - human /org=human /taxon=9606 /dataset=nraa 
/length=399 
Length = 399 

Score = 430 bits (1094), Expect = e-119 

Identities = 211/394 (53%), Positives = 274/394 (68%), Gaps = 2/394 (0%) 
Query: 2 imllltttclicctlnaggfldlenew 61 

ML CL+ TL++ G V+PE MN SEII Y G+PSEEY V TEDGYIL 

Sbjct: 3 MRFLGLWCLVLWTLHS EGSGGKLTAVDPFJNMNVS EIISYWGFPS EEYLVETEDGYI LC 62 

Query: 62 VNRIPYGRTHARSTGPRP\A/YMQHALFADNAYV^ LGFLLADAGYDVWMGNSRG 121 

+NRIP+GR + GP+PW-HQH L AD++ W+ N AN SLGF+LADAG+DVWMGNSRG 
Sbjct: 63 UsIRIR^RKNHSDKGPKPWFLCP^ 122 
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Query: 122 NTWSRRlllCrLSETDEKRA/AFSFDEMAKY^ 181 

NTWSR+HKTLS + ++FWAFS+DEMAKYDLP I+FI+NKTGQE++Y++GHS GTTTGF+ 
Sbjct: 123 hrn^SRKHKTLSVSQDEFWAFSYDEMAIOf^ 182 

Query: 182 AFSTMPEUVQRIKMNFALGFTTSFK^^ 241 

AFS +PELA+RIKM FALGP S + T + LP+ +IK +FG K F + K 
Sbjct: 183 AF^IPEUyCRIKMFFALGPVASVAFCTSPMAKLGRLPD 242 

Query: 242 ASTiaCNNiaLV^ICSEFIvSLWAGSNKKhlMNQ 301 

T +C + IL +C L G N++N+N SR+DVY +H+P G+SV N+LH Q 
Sbjct: 243 LGTIWCTIHVILKELCGNLCFLLCGFNERN^ 302 

Query: 302 DEFRAYO/^NDADNMKHYNQSH PPIYDLTAMI<A/PTAIWAGGHDVLGTPQPVARI LPQIKS 361 

+F+A+DWG+ A N HYNQS+PP Y++ M VPTA+V^fGGHD L DV +L QI + 
sbjct: 303 QKFQAFLMjSSAKNYFHYI^ 362 

Query: 362 LSLVLSLLPEWEPTFDFVWGLDAPQRMFSGNHNL 395 

L S +PEWE DF+WGLDAP R+++ NL 
Sbjct: 363 LVFHES-IPEWE-HLDFIWGLDAPWRLYNKIINL 394 (SEQ ID NO: 4) 



Hrrmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



PF00561 alpha/beta hydrolase fold 46.7 2.5e-13 2 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



PF00561 1/2 112 195 .. 1 71 [. 38.8 6.7e-ll 
PFO0561 2/2 294 352 .. 139 196 .. 8.0 0.19 
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1 TTATGGCCTA AC L I I I I IA A CTTTGAGTTA TTTTCAAGAG AAAATTTGAA 
51 AAAGCAGCCT TTGAGGAGAA AGAAGCAATC CAACAAACAA AAAGATAACC 
101 ACACTGTAAT AGGAAATGTG TTTTGAATAG GACATTGGAA GAAAAATAAT 
151 AATC AI II I I ACAGGTAGAT CCCAAAGTCA AGGATCTATG TTCAACCATG 
201 TGTGTTCCAC CATCTTCACA ATTGAATGAG TAACCATCAT TAAGCAGTTA 
251 GCTTAGGCCG TAATATGATT CTTGGACTGA GATTTCAAAA A TACCAC AGG 
301 CCTTCTGAAA GGTTACCCCT TTCTAGCTCC ACTATCATCT AATTTTATTA 
351 AAAAAAAAAA AAAAGGAAAA ATTTGAGCTT CTAGAGAGTA GGGGCTACCA 
401 TTTTGTATCC CACAGGGCCA AGGAACAAGT TTTAATGTAT TCATTTAAAT 
451 TAATTTCAGT ATGAGTATTG AAATATATAA TAGAAATATT GTAACATTAT 
501 ATATTTTCTA TATACTTTTA TTATATAGAA AATATATATT ACAGAATATA 
551 TTATTAAATA TTGTAGAACA ATATATAATA CAGAAAAATA TATAATACTC 
601 AGTAATATAT TAAATACTTA TTAAAATAGC AAGCTTATAT AGGAAGAGTG 
651 ATGGAGCATT GTGAGAAAGT TTCAGCTTTA I I ICI I IGAC ATTACTTTGT 
701 TTCTGCACAA ACAAAAGAAT TACAGGAATT GTCCAGATTA TTCAAATAAC 
751 TCGAAGTTGA GGAGGGAATA TAAGTCAATG ATGTAGAAAC TCTTTTAAGA 
801 TTTGAGCTAG CCTACAATCT GTAAAGATCT GTGAAATTGA ACTATATTTG 
851 TGCTATTTCC ATATTAAGTC AAGGCAACAA ATCAATATTA ATAATAATAA 
901 CATAGCACTT CTAGAACTTT CTAAAGAGTC CAATAAAGTT TTGTTAGAAA 
951 GGATTGTTTT TGAAGTTAAA AACCATGAGA AATTCCAGGA AAATCCACAT 
1001 ACCTATGCCA TCATACTATC AATCAGGGCA AA ACATGC TT GAGTCTTTCA 
1051 TCAAGACTAA ATGATTAAGG AGTGGTACAT AACTTTTCCC TGTTCTGACT 
1101 AGCTGAACAC TTCCTTTTAC TCCACATTTG TTTAATTGGC ATGAAATTTC 
1151 CCACTCCACT AAAACAGATC TTAGGATTTG GACAACACAA AATATCATTT 
1201 GTTTTGAAAG GATTTGAGGA TAAATCCAAA CTAATAGAAC TGAAACTTCT 
1251 ATATTATGCT GGGTAGCAAC TTAGTTTTCC CTACCCTTCT TCATGCTGGG 
1301 AGATGAAAGA GATTCAGTTA CGGCTTAAGC TCCACAGGCA TACAAAGTGA 
1351 AGCAGAAAAC TGAGGCACGT GTGCCTCCAT TATCTGGTAT CTCATGTGGG 
1401 GCTTAGAGGT AAATTGTCGT TATTTGGCCT CCATTTCTGC CTTTAACCAC 
1451 TGGTGTAAAC AAAGGTTACT GTGCCAAAGT TGACAGCAAC CCAAATCCCT 
1501 TTGGCATGTG AATTAGTTTC CTCTGCCATA CTGCTAGTTC CAAATTCCTT 
1551 CTGGTTTCAG GATTTAGGAG TCAGGGTTGC CTCATCTTCT CAAATGAGTT 
1601 ACAGTCACGC ACATCCCTAC ACACTGCATG GTTGGCACTA GTTCCTTGAT 
1651 ATATGTTACT CCGTTTGATC CTCATGAAGG ATCAAATGGG GAAGGGAGAT 
1701 ACTATTGTCT CTGATTGTCC ATTAAGATCT TGAGTATGTT CTACTTCCCT 
1751 GTTTGACACA CTGGTTTGAA AATGTTGCTA AGTCTTCCCA ACAATGACAG 
1801 ATACTCAGTG GAAACATGAA GGATTCCGTC AAACTGGTTA TTTTGCATCA 
1851 TGTAGACCAC TATTTCCCAA CCTGCAAGTG CATCATGGCC TTT GGTGTG T 
1901 CAGGGACACG CCTTGGGTGT GTGTCTCAGT CTAAAGCTTC CTCCTTTTCA 
1951 CAAGCTTCCT GTTTCTCATC TCTCTAGCTT CTAACTGTCA CTGTAATCAT 
2001 CTCTTACTCT TCAGCCTGAT GTCAAAAGCA AAAGTTCAGA AGTTCCTCAT 
2051 CAATAAGGAG TCCTTGTGAG CAGGTGAAGC TCATCTAACT AGGTAAGATG 
2101 AAGATCTATC ATAACCAGGA GGCAGGTTGG AAGGTGCCAG TTGCACTGGC 
2151 AGTCAGGTGC AAGAGCTCTG CAGTGAGGCT GCCTGAGTGT CCATCCTAGA 
2201 TCTCTCACCT CTTGGCTCTG TGACCTTGAG CAGGTCTTAA ATCTCTCTAA 
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2251 GCCTTTGTTT TTTTAATTGA TAAAATGAGG ATAATAATAG TACCAAAATT 
2301 AGGGAGATTT TCAGAGCTTA AATAACATAC GTGAACTATT TAGAGTAATG 
2351 CCTGCCATAA GGGGACTCAG TAGCTTATTA TTAGTTTCAT ACAATTTGAA 
2401 AAGTTTCATA ATATTTGCAG ATATAAGATG ATCTTCAACC AGATAGCTAA 
2451 TGTATGCAAA GCTATTTAGC TTCAGAAGTA AACTCTGCAT TTCTAGAAGT 
2501 TAAATATTAC TTTGTTATAG TGAATTATCT GTAATATTTA TCTCTTGCTC 
2551 ACTTTTATAA GAAAAATAGT GAAAGCATTT ATTAAGAACT TACACTGCAC 
2601 TAAATGTTAT ATATGACTTA ATCCTCACTA TAACCCTATG AGATAGGTTA 
2651 CATTATTGTC CTAATTTTAC TAACAAGGAA ACCAAGAGAC AAAGCTACTA 
2701 AAACACTTGC CTGAGGTTAG ACATCTTCTT CTGTGGTGAG GCTGGATTTC 
2751 AAATTTAGAC CATTTGACTG TAGCACTTAT ATGATGAGCA TGCTGTTTAG 
2801 TGTTATAGTG TTGGTCTACC TTTGAATAGA CATACTTTTA AACCATGGCA 
2851 AGGAAGTGAG ACTGCACATT GAAATATGTA AAATTTGCCT TTGGGTGCCA 
2901 CGTGAGAAAT AGTCACATCA CTAGAAACTA ATCATAAGCT TTTGTGTTTG 
2951 GTTAAAGTTT TATTGATCCA TTTTTCTTGT TTACTTTGTG GGATACTGGG 
3001 CTTAACTAGG GGATACCTCC ACI I I I I ACT TGGCCATGGT ATGAAAACCT 
3051 GTCCTCTGAA TCTTTAGATA TTTTGGCAAA TTGTAGGCAA ACAAAGACTT 
3101 AAAGCAATTC AACCTTGATT AAAATAAGAC CAAAAATGCC TCCATACTTG 
3151 ATTAAATTTA TTTCATTTTA GGAACTGGAT TATAATCAAG ACAACTTCTA 
3201 CATGAAAAAA TAGATTAATA GTGCTCCAAG TTAGTTCACT GTATTTATTC 
3251 CI 1 1 I IA TAC ATTATCTGCC TTCGGTGTTA TTCAAGTTTT CATTAATCAT 
3301 TAATAATTTC ACTAATCATT TTATTTCATT AATCAACATT GATAGTTAAA 
3351 ATTAATCTGT GAATATTAAA TGTTTTATGC CAGGCATTTC TATGATGTGG 
3401 CTGCTTTTAA CAACAACTTG TTTGATCTGT GGAACTTTAA ATGCTGGTGG 
3451 ATTCCTTGAT TTGGAAAATG AAGTGAATCC TGAGGTGTGG ATGAATACTG 
3501 TAAGTCATGG AAAACTGTGA AGAACATCAA ATAAAGCAGG ACTAATGGAG 
3551 TATGAGGTTA CGAAAGGTCC TGTTGTAACA GAAAATCTCT GATAAAACAG 
3601 ATAAAATGTA GATGGI I 1 1 I AACCTCTGCA AGAGTCAAGC TAGTTAGATC 
3651 TTTGTCTGAA AAACAAATAC TGTCCGGTAA TGAAAACCAA ATTGTGCTAT 
3701 TGTGCTATCT ATCTATCTAT CTATCTATCT ATCTATCTAT CTATCTATCT 
3751 ATCTATCTAT TTATCTATCT ATCTATAGAT AGAACCTCCT CmTGAATT 
3801 TATGTTTTAA GAATATCAAG CTATTTGTTG ATATACATGA TTGCCTTCTA 
3851 TTGATCTATA GTTCTATTAC TTTTAAAGCA AGAGGGGTCT CAAAAGACAA 
3901 TTGACTTGAT AATATAGCTT TGTCAGAAAG AATGGGTCAA TGCTAAATTT 
3951 TCCCCCAACC CCCCAAAATA TTAGCCAATA GTAGATATTT TTTAAAATTC 
4001 TACTTATTTT GTATTAAGAC TTTATTTATT AATTTTACAG TTACCTGGTG 
4051 CTACAAATTT CAGATAATTC ACCCTAATAA GCACACAACA GATGGTTTGT 
4101 TTTGATTCCT TTTTATATCC TTTGGAGAAG TTCCACTAAC GACTGTATTT 
4151 TTACTGGGCA GAGTGAAATC ATCATCTACA ATGGCTACCC CAGTGAAGAG 
4201 TATGAAGTCA CCACTGAAGA TGGGTATATA CTCCTTGTCA ACAGAATTCC 
4251 TTATGGGCGA ACACATGCTA GGAGCACAGG TACAAGATAT GTCTCTCCTG 
4301 AAAAGGGGAC TGCATTGACC TCCTGCTTCT CAGGAGGAAT TTAATGCTAG 
4351 ATATGCATCA ACAGAGTTTA TCAAAATTGG TTTGAATTAT TGGATTAGTC 
4401 TTTAAATAGT TATCAGGGAG GCTCACTCTT TGCCTGATAA TTCTCTGAAG 
4451 ACAGACAGGA ACCTAAAAAT ACAAACAGCA AGACTGATCT TGCTAACTGC 
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4501 AACCAGAGGT ACTTGTTAGG GTGTAAACAG AAAGGCAGAG CCTGCATTTT 
4551 GTCACCTCAT TACTGATTTA TCATGTGGAA AATTGCTTTG TCCCAGGAAA 
4601 ATGGATCCTC TCATTGTCAG AAGGAGATTT TCTAGGTTGT ATGAAATTGA 
4651 CTCTGGGGCA CCCAAGAAGA ACCTCTCCTG CTCCCACTAA AATTAAGGGG 
4701 CCTCCCTCTG CAGGATAAAA AACAATCTAG TTAAATGACA ACGCATTTCT 
4751 GAAAAGTTTT CCAGGACTGA AAACCTTAAC ATCCACATAC ACTTTGATCT 
4801 AAGGGACAGA CGGTTCATAG AATGAAAGAG TATGGTGTCA ATAAGGCTTG 
4851 AATTCTAGAA TGAGGAGCCA GCCATGCCAT AGCAGGGGAA TGATACTCCT 
4901 TAAAAGGGAA AATTTAACTA CAAATCCTCT GAAGTAGAAA TGATAAGAAT 
4951 AACCAAAATA TCTGCAATGG TTCAATAGCA AATAATTTAT TGGCAGCTGC 
5001 TTACCGTGTT CATTTTGCAT CTTTTTTCCC ACCACACATA TTAAGGAGCA 
5051 GCTGAAGTCA TGTTTGACAT TCTCTCCCTC TTTTATCTCC AGTTTCAGAA 
5101 TGAAAAATGA GAGTGAGATA TGAGTAGTTT TACTAGTTAA AATATGAAAC 
5151 ACCCAGTTAA ATTTGAAGGT CAGATAAACA ACAAATAATT TTGTATAAGT 
5201 CTCATTTTAA GATAATACTA AAAAGTCATT ATTTATTCAC TATTATCACT 
5251 ATTTATAAAA TTTTGTAGAG CATCCTGGAT CTTTTTGCTT ACTTTTGTTT 
5301 TTATTTTTTG CTAAATCTGG CAATCCCAGG CACATGTGTG AAGGAGCTGT 
5351 GAAATATAAA AGGAGAAAAC TTTTATGGGA AAGATTTGGC TTAAGGAGAG 
5401 ATAATTTTGG AAAGATTTAG AATTAAAGAT CATTCATTAG ATGTAATGTT 
5451 CTAAATACTT TATATCAGTT AAACTTCTCA TCAACAATAT GAGATGGGTA 
5501 CCACTAATAG TCACCATTTC ACAAATGATG AAATTAAGGC ACAACCGGTT 
5551 ATGTTAAGAG GCCTAAAGTC CACAAATAGC AAGCTGACAG ACCAGAATTT 
5601 AAGCCCAGGC ATGCTGGCTC CAGAGCCTGT GCTCTTAGTC ATTAAATTAT 
5651 AGTGCCTTAC TTGACCTTCC ACCCTGGTTA CnTGGATCT CCCTGAATGC 
5701 TCTCTCTCCC TCAGAAATAC TGGAAGTTGG CAGAGGGACA CTGAGCTGAG 
5751 CATATTATTG TAG I I 1 1 IAA ATGCTCTCCA CTGGACAGAA GATGGGGGAT 
5801 TTGAATAGAA ATTTGGTGAG GAACTAATCA GTGTCCATTT ACACTCACCT 
5851 CCTCTTCCTC CCTGGAAGAG CTATAGGACT TGAGTAAGCA TGATAAATTT 
5901 CGTGTCTTTG TAAACCACAC CCAGGAAATT TGTATATACA AATACATAGA 
5951 GCACAGTAGT TATCAGGACA GACTTTGACA TAAAAAGAAC TGGGTTTGAG 
6001 TCCCTGCTCT GGCCTTCTTA TCTGGGTGGC CCTCTGGGAA AGTTACTTAA 
6051 CTACATAAAG I 1 1 IGI I ICC ATATCTACAA AATGAGGTTT CTCAAAATAG 
6101 CAGCTAGTTT ATAGAGTTGT TGCAAGAATT TAGTAAGCTA ATACATATAA 
6151 ATACGTCAAC ATAGCACCAG GTACAAAAAT ATGTGCTCAA GAAACTGAAG 
6201 TTACCTGATT ATAATGCTCT ATACTATTGA CAAGGGAAAA GTGAAAACAG 
6251 I I II IGI 1 1 1 ACCATGTGTG TATGTGTGTG TGTCTGTGAT GTTTCCGACA 
6301 TGCTCTATTT AACATAAATT ACTCTCACTC TTTCTCTCTC TCTCTTTCTC 
6351 TTTCTCCCTC TCTCATCTTA CCCTTTCCCC CACCAGGTCC CCGGCCAGTT 
6401 GTGTATATGC AGCATGCCCT GTTTGCAGAC AATGCCTACT GGCTTGAGAA 
6451 TTATGCCAAT GGAAGCCTTG GATTCCTTCT AGCAGATGCA GGTTATGATG 
6501 TATGGATGGG AAACAGTCGG GGAAACACTT GGTCAAGAAG ACACAAAACA 
6551 CTCTCAGAGA CAGATGAGAA ATTCTGGGCC TTTAGGTAAA TATTAGCTAA 
6601 GAAAACTCAA GGGGGAAATT GGAGGCAATT TTAAAAAAAT AACGTGGACG 
6651 CTATTAATGA TTATCTTTGA CGCTTGAAGT CATATAGCTC CTTGTAGTTT 
6701 CTGTTAAGAT CTCAAAGGAG GGTAACAGCA AGAAGCTCTG ATTTTTCACT 
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6751 GATTCTCCCA CAAGCAAAGT ATGGCATTTC AACAAGATCA I I I I lACATC 
6801 CAATTCTGTG AATTCTATGC ATTAAAAGTA TGTCCAAAGA GACAGCTCAG 
6851 GAAATTATCA TGACCAATGT GCACATTCAT TCAGCCAATG TTTACTGAGT 
6901 GGCTACTGTA TGCGCTGTTC TAGGCCCCGA ACATTCAAAC AGGGAACAGA 
6951 CAAACTCTGA CCTCACAAAG CTTATGTTCA TTTTAGTGAT AATTTTACAA 
7001 GTCATTGCTC CTGGATTGCC AATCAACTGT GTAAAGATGA TTTGGACCAG 
7051 GACCTTATTG ATTTAGAGAA ACTGTGATTG ATTTAGAGAA ACTGAGATCG 
7101 CACATAGTAC CATTTTCAGG AAAACTCCAA TATTAGATTT TTAAAACCTT 
7151 GTTAATGGGC AATGAAGAAG AATCTTTTTT GATATCTTGT TTCTTTTAAT 
7201 GGAAGAGTTT TCTGCTGTCA CCAGAGGACA GGCTGATGCC TGCGATAGAC 
7251 TTTTCTTTCT TCAGGCCTAA GCTCCCTGTT GGTTTGTAAA CCTGATGCTA 
7301 GAACAGACTG TGTATTCCTA TTACATTAAT AAAACATTCA GTACCCACTG 
7351 AAAGTTTGAG AATAGTGGAG GAATAGAATA GAATGTTATA GTCTGAGTTC 
7401 TTGGGCAGGG GCAAGCATCA GGAAATATTG AATCATTAGT CTTTAGGAGG 
7451 TGTCACAACA ATTCTCCTAT TCTTGTAAGT CCCAATCTAT AGATTTCCTC 
7501 ACATGTTCTT TTAATAAACA GGCTTCTAGC TTATGGAATA CCTGATTTGA 
7551 CTAAATGTTA TATAGGCCCT TTTGTTCCTC CTGTCTGAAG AACAAAATAC 
7601 TAGTACTATG GAATATTGGT ATATATTAAA TATATATCTA TATATCCATG 
7651 TGGACAGGAA TACTACTACT AACAACATCT TACTGAGCAC CCACTGGCAG 
7701 CCAGAGTCGT TTCTTTCATA CTATTAAACC CCGTTAGCAG CCCCGTAAAC 
7751 CAGGTACTAC CCTGTTTATT TCCCAAATGA GAAAACATAG GCTCAGAGCA 
7801 TTTCAGTAAT TTCTCAAGAG TTGCAAAGGC CATAAATAGT AGAATCATGA 
7851 TTTACAAAAC CCCTGTTTCC AAAGATGGGT ATTAAATGGT CCTAACAATT 
7901 GTGAAGCCTC ATGTGGGAGT CAGAAGTAGA GGCACACAAG CCAGATGGGG 
7951 AAAGGGAGGG CAAAGAAAAG CAAGAGAAGG GAAGGAAGAG GAGGGATCAT 
8001 AAGGTTGAAC TTCAAATATC ATACACAAGT TTCGAAAGTG TTCCTCTTAT 
8051 AAGGAAGTAA AATGTACATA TGCAGAAAAA CAAAAAGCTA CAATAGCCTA 
8101 CATATAATTG GATAAATAAT GAAATACACA TTGAATCTAA GTAAAC AGCA 
8151 TAGAATCTGG GTGTAAAAAA GAAGTGAGCA AGTGCTCTGA GT TTTAAA CT 
8201 TAAACTTGCA AGTATTTATA AAAGCCCCTG TTTTATTTTG CAGTTTTGAT 
8251 GAAATGGCCA AATATGATCT CCCAGGAGTA ATAGACTTCA TTGTAAATAA 
8301 AACTGGTCAG GAGAAATTGT ATTTCATTGG ACATTCACTT GGCACT ACAA 
8351 TAGGTATGTT TATGAGGGTC ACTGTTAGGT GTGTTTTTGA GGGTCAGTTT 
8401 TCTCAGAGTC TTACAGGAGT TCACCTTTAT GTTGGAATAA AACAACTGTT 
8451 ACTTATAGTG CCCTCAATTC CCTGTCCTCT GCTGGGAATA ACCCTAGTAC 
8501 TCTAAGTAGC TGTGAGCCTG CAGTGCACAG ACTATATGTA GGGCAAACCT 
8551 TTCCTGGGTC TCTGGTCACA GCAGCATATT GACTACGGTG ATGCAATTTC 
8601 CCAGGAATAA CATGTGTTCC AAATTCAAAG AAATAATTCC ACAGAGTAAG 
8651 TTTCTAGATT CCCTCTGAGC TGAAAAAGTA AAATTCAATG CCATGGAATA 
8701 TGGCTGAAAC ATAATAAATG TGCATCAATC ATCTCTTTCT CACAACCCAA 
8751 ATGGGATTTT TAAAAAATAA AAGGGAAGGG CTTATACCTA TATTTAAACA 
8801 AATTGAAAAG GCATGGTTAT ATTTGTTTGT GAGTTGGAAC ACACAAGCTT 
8851 ACTATAATAA ATCAATTGAG CTTATCTATT CAGTGTGTGA TTTA GTATTT 
8901 ATGAAATAGC AAGTAAATGT AAGCACTATG TAGAAATTTC TAAAGTTTTT 
8951 TAAGCTGACA ACTTACTTCT TAATTTACTT ACTTTACTTA ATTTACTTTA 
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9001 CAATTTACTT TCCAGGTATT TTGGAAAGAA ATCAATAATC TAGTTCCAAG 
9051 TAAAAGTTGA AAGGAACCCA CACTAATAAA AGCTTTGAAT TTGTCATTGA 
9101 ACTTCCACTA AAGTTTCCAA TTTTAAGAGA ATAAATCATG TGAAAGTGCA 
9151 ATATTTCAGT TTAGGGAAAT ATTTTCATTA TCACCACTAT CATCAGTAAC 
9201 AAACATATAT TCATTAGTAT TTTAGATTGA CAGGCACTTT CCAAGCTCAG 
9251 AACAGGCAGT TAGCATCAGT CAGCATATAC TAAAAAAGTA TCAAAGAACT 
9301 CATAGGAGAT CAAAAATGCC ACCAATAGGC AAATAATTAC AGTATCTAAC 
9351 ACTTATTGAG CATTCGTTAT GTGTAGGGTC TTGTGTTCAG GACCTTCCCC 
9401 ACAGTATCTC CCTCTGATCT TCAAAACAAC CCGAATGTTA TTATCCCCAT 
9451 CTCATAGAAG AAGAAACACA AGTTCAGAAC ACAGATTCAA ACCAGATGTA 
9501 TCTGATTTCA CCAATAGGGT GTGTAAGGAT TCCGGAGAAA TGGTGTAGAG 
9551 AAGAAGAAAT GACTTTAGTT GGTTTTGGAA AGTGGGTAGG ACTTAGATAT 
9601 GCTCTTATAC TTGATCTGCA AAAAAAAAAA AAAAAACCAT GGAGAATTTG 
9651 ATTATCTGTG CTCTGTGTTT CATTTAGGAC ATAAATATTT TTAGTGACTG 
9701 TTGTTTGCAT TTTGGACAGA GCAATTTCTG TTATGTAAGG AGCACCCACT 
9751 CTTTGTAGGA CATTTAGTAG GTCCCAGCCC ATTAAACAGG GCTCTGCAGT 
9801 CAGCGTGACC CTCAAAAATC TCACCTCCAC ACATTTCCAA ACACCCTCTG 
9851 GGGAAGTACT ATTCCTGATT CAGAGTCTTT T TATCAA TTG TTCAGTCAAT 
9901 TATTTCAGTT CTTCTTTTTC TGGCCAAGAC AGTTTTAATG TTCCAACAAG 
9951 TGTTTCAGTA CACACATACA CACACACACA CACACACACA CACACACACA 
10001 CACATGCTAG TGGAGGCCCA GGAAGGGACC TCTGGAAACC AAATTATATG 
10051 GATATTCTCC CTAGCCTACC CAGTGTTGTG CTAATCTCCA TCCTCACAGA 
10101 TATACAAAGG GGTGCAATGC TACTGCTGAA AGAGCAAAGC AAATGGAGAT 
10151 GCCTGGTCCT TACTGGGCCA TCGTGGATGC TAGGGAAAGC CCCTTTCTTT 
10201 TTGGAAACAG GGAAGAGTCT AGAGGGTTGA AAAACACCCA GTAAGACACT 
10251 GGGAGCAGTG AAATTTCATT CCATAGTGAG AAAGAAAACC TGTTAGAATA 
10301 ACTGGGTGAT GCTGCAGAAA GAAATCAATT CACCTCCTGT GACTGATTAT 
10351 TTGCTTCTGG AAGCTCTGTG ATTCATTCTG GCATCTCAGA GTTAGGGATG 
10401 AAATGAGAAT GTTGCCAGCA TTTACCCCAT GCTTGGGAAG TTTACACAGC 
10451 AGTAGCTACT CCAGCAGCTT AACCATCACC TTTCCCCTGC CAACTACTCC 
10501 ATTTCCCCCA ATCAAGTCAA ACTGTCCATA AATAGAATAA AATAAAATTG 
10551 GAGACTTGAG AGCAGAGAAG ACTGAAGGCA GATTATCTTT ATAGAATAAC 
10601 TCAGAAGACT TCCAATTCAT CCCCAGTATG ATCACGATAG AAGGAAAAAA 
10651 TGACTAAGCA GAGCCCCAAT TTTGTTAGAA ACATTGCGTA AGT ATTTAT T 
10701 TTTACAAGAT TGTCTTATCT CCTGTTCTCT CAGGGTTTGT AGCCTTTTCC 
10751 ACCATGCCTG AACTGGCACA AAGAATCAAA ATGAATTTTG CCTTGGG TCC 
10801 TACGATCTCA TTCAAATATC CCACGGGCAT TTTTACCAGG TTTTTTCTAC 
10851 TTCCAAATTC CATAATCAAG GTAGGCTCCT TTCAACAAAA TGTACCTGAG 
10901 GATCTCATTT TGGATCATAA ATCCTTATTA TTTTCAAATC TACTGTAAAG 
10951 TAAAAGTAGG AAATTTAGAT AAAATCTATA GAACTTAGAC TCTGTGGGTA 
11001 TGTGCTTGTG TATGTGTGTC CCTGCGTGTG CGCATGTCTG TGCCATAGTA 
11051 TCTGCAGGTT CTGTAATACA ATTTACTATA CAAGGTCATC AGCAGGCTGA 
11101 GTATATGTCA GAATTTCTAG CTGAACTGAG TGCTATATGA CAACAAGGAT 
111^1 nTTG IGI I TTCCCAAGTG TTTTTTGTTC CATTTAGTCA GGTAGGTCAA 
11201 TGAATTCACA TTGCCCAAAT GAAAGACACT TCAAGTTACC CATAATCACT 
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11251 GATGTGTCCA ATTTTGACAT TAGAAAAACC TGATTAATAT ATTCOTCCA 
11301 ATATQGAAAC TTGCCCTAAT AACTAAAGCT AAGATTCCAA AGCCTAAATG 
11351 TATTACAGCT CAAGTATTAA TTCAAATATT TATTGGTTAT TTTTCAGGAG 
11401 TTGAAAAAGT CATTTGGTTG CCAATTGTGG ATTTGGGATT TTATCTATTA 
11451 AAGG GI I I I I I 1 1 1 1 I I I IC TCTTTGCTTT TGTTTCTCTA CAAAGGTCAT 
11501 TGCCACAATG AACACAGCAT TTAATCAAAT TCCAGATTGG CCTTTGAACT 
11551 TGGGATGATG GATAAAATGG ATTTGGGCCA AAATTGAAGT CAAGGAGACC 
11601 AGTTAGAATA TCAAAATAAT TCATATATAA GAAAATGAGA CGTTGGTTTG 
11651 GGGTAGAGTG GTAGGAATGA AAAAAATTAT TTGTGAGCTA ACACAAGGAA 
11701 TAATTTCCAT AGGGCCTAAT AATAGTTAGG TCTGATAATA CTATGGTCTG 
11751 ATAATAGTTT TATTGTATTG TTTACTGAGA GCACAAATGA TGTAACTTCC 
11801 TTATTCAAGA GCTTTTCTAG TTTATTTAAA AATGTGTTGA CATCAGTTAG 
11851 GTTTTAATGT TTTCTATATT TGGACAGTGT GAGCAAACTA ATTTGTTAAA 
11901 TTAAATTCAG AGAGAGATAC ATCTATCTGT AAATACATAT ATGCGTTGTT 
11951 TGTGTTGCTC TTCCTACATA GGTCAGCTAT AAGGCAAATA ATGTTCCTGG 
12001 GTTATCTCAG TTTCACATTT CCCACTGTCA ATATTCCTGC TACTTTTAAG 
12051 TCCCATATCC TGCTCTTTTC TTCCGTCAGT TTCCCCCAGA AGCTCCAAGA 
12101 CCCCACCAGG AATCCCCATC CAAGTTTACT TTCCCAACTC CTGGAAGTTT 
12151 CAATTGTGCT GCCTTTGTGA CATTATCATA TCTTTTCTGT TCAATGGTTG 
12201 CTTCTCTTTG GCTCACTGTT CTCTACTTTT CAGCCTGAGA GCTGGCTAAT 
12251 CTGGGACAGT ACTCGAATGC AGTGTACACA TGGGTAACAT GGAAA ACCCC 
12301 GATTTTCCCT TATATTCAAG GTATTATTTG ACCTTAAGAA AAACTGTTTT 
12351 ACATTTCATA CCAATTAATG AGAAAAAAAT ATTGGCAAGC ACTGACTGGG 
12401 CAGAATACAG GGAAGCTTCA CTATGGAGAA GTGAATTTGG GATTGAGGGC 
12451 CTTTATTGCA ATCTCCTTGT AAATAATATT TGATACTCTT CCTCATCTGG 
12501 AGACACATTC CTAAGTAACT TTTCCTGAAT AATTTGGTCT CCTTGACTGA 
12551 ATCAGTAAGT ACAAATAGAT CCCCAAGCAT GGCTCTTTCC TAGAAT GAAA 
12601 GAAATGTCAA GAAGTCTGAA GATGATTCTT GAATTTTGGT TTTTTGCTAT 
12651 TGCTATTTGG GCTTGTTGTC CTTGTTGTTG CTATTGAGTT GAGCT CCTTA 
12701 TATATTCTGG TTACTAATCC CTTGTAATAT GGATAGTCTG CAAATA TTTT 
12751 ATCTCATTCA AAGATAATTA TTATTTACTT TCATAGGCTG TTTTTGGTAC 
12801 CAAAGGTTTC 1 1 I I IAGAAG ATAAGAAAAC GAAGATAGCT TCTACCAAAA 
12851 TCTGCAACAA TAAGATACTC TGGTTGATAT GTAGCGAATT TATGTCCTTA 
12901 TGGGCTGGAT CCAACAAGAA AAATATGAAT CAGGTATGTA TGAT AATTAT 
12951 AGGGCCATTT GATACCTTAA GAAATTCCAG CTTTCCTTTG ACTCATTTTG 
13001 ATATATCTAT TTACTGTATA AATTCATATG GTATTCCAAA CCCTTAAAGA 
13051 CAG AI I I 1 1 I TTTGCTTTTA AAAATGTTTA TGGGTATATA ATAGTTGTAC 
13101 ATATTTATGA GACACATATA TTTTGATATA AGCATACAAT GTGTAATGAC 
13151 CAAATCAGGG TAATTGGGAT ATCCATCACC TCAAGCATTT ATCATTTCTT 
13201 TTTGTTAGAG ACATTCTAAT TTGACTCTTC TAGTTATTTT GAAATATACA 
13251 ATGAATTATT GTTAACTATA GTCATCCTAT TGTGCATGCC AGACTTTAGT 
13301 CCTTCTAACG GTATTTTGGT ACCCATTAAC CAATGCCTCT TTATCCTTCC 
13351 CCCACCCCTA CTACCTTTCC CAGCCTCTGG TAACCATCAT TCTTCTCACT 
13401 ATCTCTATAA GGTCAGTTTT I I I I IAAACT CCCCTATATG AGTGAGAACA 
13451 TGCAGTATTT GTCTTTTTGT GCCTGGCTTA TTTCACTTAA TGTAATGTTC 
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13501 TCTAATTTCA TCCACATTAT TGCAAATGAC ATGATTTCAT TCTTOTATG 
13551 GCTGTCTATA TGTACCACAT TTTATTTATC CACTCATCTG TTGATGGACA 
13601 CTTAGGCTGA TTTCATATCT TGGTCATTGT GAATAGTGCT GTACTAA ACA 
13651 TGGGGGTGCA GATGTCTCTT CCATGGATTG ATTTCCTTTT I I I I I iCTGA 
13701 ATATAGACCT AGCACTGGAA TTGCTGGATC ATATGGTAAT TCTACTTTTA 
13751 Gl 1 1 1 1 1 G AG GATCCCTCAT ACTCTTCCCC ATAGTTCCTG TACTAATTTA 
13801 CATTCCTACC AACAGTCTGT GCAAGAGTTC TCTTTTCTCC ACATTCTTGT 
13851 CAGCATCCAT TATTGCCTAT CI I I I IGATA AAAGCTATTT TAACTGGAGT 
13901 GAGATAGTAC TTCATTGTAG TTTTAGTTCG CATTTCTCTA ATGATTAGTA 
13951 ATGTTGAACA TTGTTTTTAA TGTACCTGT GGCTATTTGT ATGTCTTCTT 
14001 TTGAGAAATG TCTACTCAGA TCTTTTGTCC ATTTTTAAAT CAGAl I 1 1 I I 
14051 TTTTGCAATT GAGTTATATG ACCTCTTTAT ATATTCTGGT TACTAATCCC 
14101 TTGTCAGATG GGTAGTTTAC AAATATTTTC TCTCATTCAA CAGGTTCTTT 
14151 AGTTCACTTT GTTGATGGTC TCCTTTGOT TGCAGAAGCT TTTTAGCTTG 
14201 ACGTAATCTA Al I I GI I CAT GTTTGCTTTG GTTGCCTGTG CATTTGAGGG 
14251 CTTACCTCAA ATTGGCCCAG ACCAATGTCC CGGAGTGCTT CTGTAATGTT 
14301 T GI I 1 1 I I A G TAGTTTCATA GTTTTAGGTC TTAAATGTGT CTTTAATCCA 
14351 TTTTGATTTT GTTTTTGTAT CTGGCAAGAG ATAGAGATCT AATTTCATTC 
14401 TTCTGCATAT GGATATCTAG TTTTCCCAGC ATCAI I ICI I GTGGAAATTG 
14451 TCCTTTGCCC AATGTATGTT CTTGATGCCT TTGTTGAAAA TTAGTTGACT 
14501 ATAAATGTGT GGATTTATTT GTGGGI ICI I TATTCTGTTC CATTGGTCTA 
14551 TGTGTCTGTT TTTATGCCAG TATCATGCAG TTTTGATTAT TACA GGTTTG 
14601 TAGTATAATT TGAAGTCAGG TCATGTGATG CCTCCAGCTT TGI ICI 1 1 I I 
14651 TCTCAGAATC TTATATTTAG AAAAACGTAA AGACTCCAAC AAAAAACCTG 
14701 CTAGAACTGA TAAACAAATT CATTAAATTT GCAGGATACA ACATCAACAT 
14751 ACAAAATTCA GCAGCATTTC AATATGCCAA GAGCAAATAA TCTTAAAAAA 
14801 AAGAAAGAAA AAAAAACAAG AAATAATCCC ATTTATAATA GCTACAAATA 
14851 AAATAAAACA CCTAGGAATA AACCATACCA AAGAAGTGAA AGATTTCTAC 
14901 AATGAAAACT ATAAAACACT GATGAAAGAA ATTGAAAATG ACATTAAAAA 
14951 ATGGAAAGGT ATTCCATGTT CATGGATTGC AAGAATCAAT ATTGTTAAAA 
15001 TGTCCATATG ATCCAAAACA ATCTACAGAT TCAATGCAAT CCCTATCAAA 
15051 ATACCAATGA CATTCTTCAT TGAAATAAAA AAAAAGCCTA AAATTTAAGT 
15101 GGAACCATGA AGGTAGATGT CTGCTATACA TAGAAGATTA AGTACTCAAC 
15151 AAACCTTGAA TATGAAGACT GGGGAAGTGA ATAGGCAGCT TCACTCTTCT 
15201 ATTCCCTGGT GAAATTTAGG AGAATGGATG TTTTATAATG GGTAGCAGTT 
15251 TCTTACATGT TCTCAATCAG CCATAACTTA CTACAGTCAA TTTGAATTTA 
15301 TTGCATTTGA ATATATTGGA TTAAAAATAA AATCCTAAAA AAGGAGAGAA 
15351 GCACATATAA ACCTGCGTCT TATTTCATGT GTTCCTTTCT TTGTGGGTGA 
15401 CI I I IGI I I I GAAATAAAAC CTGCAAAATA ACAGGACAGG GTGGAAGGGA 
15451 GATGGGATCC CCTCTTTATG AAGAAGCAGC AGTCCTGTTT TATCACCTCT 
15501 TCATTTTCTG TTATTGAGAA TTCAAGAAGA AGGAGGAGGA AGAGTTCACA 
15551 TCCACAGACT GGTGTGGTTG AATAGTTGTC TCTACTGTAT TCCAAATAGC 
15601 AGCCAATGAG GCTGTTACAG TGAAGCCAGT CCCAAGATAA TTGTTCTGTA 
15651 CCCCTATTCT CTAAGAAGCT AAATTGTGTT AGACTGAAAC CCATAAGGAA 
15701 CCATTGTTCA AAGTTGGCTT GTTCAAAAGT AAAGAI I I I I AATAGTTTCT 
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15751 CTTAATTAGA TTATTTTCTA AGACATAGAA TTATGATTAC TATTTTATCT 
15801 CTATAATTTT CATCTCTATA ACGTTTACAA ATACTGAAAT AACCT7TGGA 
15851 AAAAATTGGC TTTTAGCTTT ACTTTTGCAA TATTTTATTT TATCCCCATA 
15901 AAAGCCTAGG AAATTGGTAC TATGACTTTT AGTATGTTCA TTTAATAGAT 
15951 GAAAACACAG AAACTCAAAG ATGTTAAATA TGGTGGCCAA GTTCACAAAG 
16001 CTGATCATTA ACAACAACAG GGCCTGAACT CCTGGTTTTC TGATTTAATC 
16051 TGTGACAGTG CACCTGGGTG CGCATGCATG CATCACCCCC ACACTTGCAC 
16101 ATAGAACCTT TCCTAGTTGG CTTTGCTCCA TGATGACCAT TACTGTTCCT 
16151 TCTACTTCAA AATAAGCAAA TTATCCTACA GATTCAGAGC TGGTACAGGT 
16201 GTGCTGTCAA GCAGCCCATT CCATTAGTCA GCTTGTGGTT CACTCACATT 
16251 AAAGTATTGA CCTAAATGGT ATATTTATCT AGATAATTCT ACCTTGTTAT 
16301 TTTCAAAGCC CCAGTCTTGT TTGCTAATTC TGTGCATCAT TTTTCTCTGA 
16351 TTCTGAAAGG CAAAATTTTG TTGGGCAATT GCTGTAATAT GAGTTTTATC 
16401 TCCTTTAGAG TCGAATGGAT GTGTATATGT CACATGCTCC CACTGGTTCA 
16451 TCAGTACACA ACATTCTGCA TATAAAACAG GTAGAGTCTT AGTCATGGAA 
16501 AACCATTCCA ATCCTTATTT TCAATATATT TAAAAAGACA GAATTGACCC 
16551 TGTTAACAGG CCTACCCTAA GAATCTTAAG AGCTTGCTTC CAGTTTGTCC 
16601 TTGCTGCCTT CTGTATGCCT TGATTTCCCT GGAATTTAAG AGAAAGGATG 
16651 TTATGGTACA GACCAAGTAG ATGACATAAA TGAACACCAC CTTAAATCAG 
16701 AGTTTTAAAA ATAGGCCCTG AACTGAAGCA AGAGGTAAAC TAGGGAAGCC 
16751 TCAGGAGAAC TGAGACTTCT CCAGAGAGAA GTATCTGGGA TTTAACTTCT 
16801 TTCTAATGAG GCTTGGTTTT CCATGAACTT TTCCTTTAAA CCAAGGGGGG 
16851 TATTGCTCAT CTTTCTGTTG AGCCCCATTT GTCATAATTG TAAAATGGGT 
16901 GGTTACATCC TTCTGGTGAT CTAGGAGCCC TATTTTCGTC CTAGCATACA 
16951 GCATTTTTCT AAAATTTGCT GTTAGCTTTC ATGATTCTTA CCCTAACTAT 
17001 TCTTTTTCTA AAAAACATTT GTTTCAGCTT TACCACTCTG ATGAATTCAG 
17051 AGCTTATGAC TGGGGAAATG ACGCTGATAA TATGAAACAT TACAATCAGG 
17101 TGAGCTATTT ACAGTAACCC CAGCATGCTG ATTTTGATAA ATTATAATAA 
17151 AAAATTATTT GAGGGTGGAA AGACTCCTAC CTGTCATTTG GTGGCATTTA 
17201 TACTGATAGA ACI I I I I I I I AAAAAAATTT TAATTTTAAT TTTAATTTAT 
17251 TTCAGAAAAT TTATAAATTA AAGAAGCATA TACAAAGAAA CTTACATCAT 
17301 GTGTAATCCT TCCATCCAGA GATAACTAGA TGTACTAACA TTTTGGTGTA 
17351 TTTATTCCAA TTTTCTCAGT ATTATATTGC TTTTAGACAA CTTTTAATCT 
17401 TTCTATTTTA CTTAAGCTAT AGTAAGAGAT AACTAATATA ACTGAGGGAT 
17451 TTTTAAATGC Al I I I IAATG GCTACATAAT AGAAATTATT TCATAAAAAT 
17501 CTTTACAGCA TAAATGAATA TACACTTTTT AATACCAACA GAAAAATTAG 
17551 AATTCCATAT GAAAGTTGAA TAAGTATTAC CCAACATTGA AGACTTGGGT 
17601 CGTAAGGCAT CTTTCTCCAT ATAGCTTTAT GACATAAAAA TCTGTAGCCT 
17651 TGTTTAGCAC CGTACTTTTA ATTAATCCTG TCACCATTTT TCTGTTCTCA 
17701 TAGCCAGGGG CTTGGCTTAT AAGTATGAAC TAAGCAAACT AAATTAAATT 
17751 GTTTTAAGTA TTTTCCCAGG CTATCATATT TTAAGCTATT TACTGGTGCA 
17801 ACTATAGATT ATTAATAAGT TGTTTCTGAG GATCAAAACA ATCAGACTAA 
17851 TCAATTTCTC AATAATGAAT TGGCCTGTTA GAGGAATAAT T CTACTAA TC 
17901 CTTAAAACCA CTACAAGAGA TAGACCATGT ATATTTTATT TAl I I I l AAA 
17951 AATAAGTTTA AGATGTGATT TACATACAAG AACATTACTA ATTTTGTGTG 
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18001 TCCCATTTAA TAAGTTTTGA CAAATATATT TATTTGTGTA ACCACACCAC 
18051 AATCTAAATA TAGGACGTTT ATATCACCAC TAAAAGTTTT TTTCCTGCTC 
18101 CTGAGACTAT TTATAGACAC AAATGCGTGT ATTTGCAAAT GCTTAGAAAA 
18151 GGTCTAGAAA AAAAAACAGT AAATGTTAAA GTGGTTATCT TCAGAGAGAA 
18201 GAAAGAAGAA AAGAAGTGGA TGGACATGAA ACAGTAAAGG ACCCTCATTT 
18251 TGGACTTTAC ATATGTCTGT I I IU ICCAT TATTTTGAAT AAACATGCTA 
18301 TATTTATAAA TTATTTACAT TTACAAGAAA ATGAAACAAA ATCAACACGC 
18351 ACATTCAAGA TCATTATGGT CAAGTACTAA AGTATGTGAG AGTGTTAATG 
18401 TCCTTAGAAT TTGGCCACAG TTAGCTGGTC CT ACTCTGC T CCAAGCCGGT 
18451 CCTATTTTGT GAATTAATCT CATTTGATGC CAATTTTTAT TACATTCTCT 
18501 CCAAAAAACr AGTCTCAACA GTTTGCTCTC TCCTCAAGTT CACAGCATTA 
18551 TCTCTGCTAT ATCTATATTT TATTGAGTAT AAGAGAATTA AC CCATGT AA 
18601 GCTCCATGAG GGTAGGGATT TCTCATCGTT TTGTTCACCA GTGTTTTCTC 
18651 ATCTTGAAGA GTACATGACA ATTACTGGGC TCCCAGTATC TATGTGTTGC 
18701 ATTAATGAAA TTTCTTAACT TTAATCTACC TCAAAATGTC TCTATCTTCT 
18751 TGATTCTCTC CTTCCTTTCT CTATCAGAAA ATGATGGTCC TCTTATTTTC 
18801 CAAGTTATTC CGGTCCTGTG CCCTTGATCC CATCTCTTCT CACTTCCCCT 
18851 TCCTTCCTGC CTCCATTCTC CTGTCCCTTA TGAAAAACAA GCAAGACCAT 
18901 CAATTCTATC AAGTTATCAT TATGTCACTC TGTTCTTATC AACATATTTT 
18951 TAGTATTGAA GAGGGCTTCT TCTACTTACT CCTGAACCTT GTACAATGTA 
19001 GTTTAGGTCT TCATLI I I I I ATCATAGCTA CCTTATTTAA AGTCACCCAT 
19051 GGCTTTTAAT TGCCAAATTC AATGGCCTAT CTTCACCTTT TGAAATGTGT 
19101 TATGTTCGTT ACCACAGTCT CCTTGAAACT CAGTCCCCTG ACTTG GACTT 
19151 CCATAACACA ATGATTTCTG ATTTTCCTTC TGTTTGTGAT TGTTCCTTTT 
15201 GTCCCAGGCA CTGGCTACTC CACCTTCCAC CTCTCTGAAA TCATTAGCAT 
19251 TCCCCAAGGA TTCTTCAAAA CTCTCTTTCT TCCTTGGAGA AGTCAGCATA 
19301 GCTTTAATTT GGACCATTTC TATGGCTTAT CTAGATTTTT TCAGGACTTG 
19351 CCTTCAACCT ATTCTTTCTG TAGGTGATTC CATTAACTGT TGCCCATATG 
19401 GTAGTCCGAA GACAGACCTC CGAGAAATGA CCCTTGTCTC CAAAACTTCC 
19451 GCAATATGTC CAAATTTCCT AGCCTGACAT TCAGACTTTG ATTATCTGCC 
19501 TCCAAGTTTA TATCCTATCA TATTCCTTTA TATATTCTGT TCTCCAGGTA 
19551 CACTGGGAAG CTTGCCATTC CTGATCATAG CCTACAAACT CTTCCTGCCT 
19601 CCCACTCACC CTCATCTCTG CTGTCAAAAT GCAACCTTCC CTCA AGAGTC 
19651 ATTTCACAGG ACCCCTCTTT CTATGAAGCC CTCAGGTGGA AATAATTTTT 
19701 TGC LI I 1 1 I I TCCATTTTAT TTTTGGAGTG TTTATGGCAT TTAACATACC 
19751 TTACTTTGTA TACAAATATT TGCCTTGCTC CCTCTTTTGC AAATTTCTTA 
19801 AAGGTAGAGA CCATTGTATG I I I IU I CAT ATGTTGCTGG TGCCTAACAG 
19851 AACTATGGCC ATTGTCCACA TTCATTTAGC AGCCTTTGTA GTTATTGCTT 
19901 TGAGGAGCTT CCTCTCATGA ATGCCCTTGC TTTCTCTCCC ACAGAGTCAT 
19951 CCCCCTATAT ATGACCTGAC TGCCATGAAA GTGCCTACTG CTATTTGGGC 
20001 TGGTGGACAT GATGTCCTCG TAACACCCCA GGATGTGGCC AGGATACTCC 
20051 CTCAAATCAA GAGTCTTCAT TACTTTAAGC TATTGCCAGA TTGGAACCAC 
20101 TTTGATTTTG TCTGGGGCCT CGATGCCCCT CAACGGATGT AC AGTGAA AT 
20151 CATAGCTTTA ATGAAGGCAT ATTCCTAAAT GCAATGCATT TACTTTTCAA 
20201 TTAAAAGTTG CTTCCAAGCC CATAAGGGAC TTTAGAAAAA ATGGTAACCA 
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20251 ACAATGAGGT TGTCCCCCAG CACCCTGGGG GAGATGCACA GTGGAGTCTG 
20301 TTTTCCAAGT CAATTGTGTT AGTGTTATTT ATGTTTAGAG ACATCTTTGC 
20351 ATGGGACCAT CTACAGGTCC TTATAAACAA TGAGGTAGAT TAGGCAAAAA 
20401 GATAAACAAG TTGCTACTCT ATCTGGCATT TAAGTCTAAT TAAATTGTAA 
20451 TTTTTAGGGC ATACCATGAA GTATAGAAAT GTCTGAAGCT TCAAAGGAAC 
20501 AGTGAAATTC CTTTAAGGTC CTATATGGAA ACCTCTGTTG TCATTTTATT 
20551 TATATGGATT GCTATGGCAA TGGACAGAGT GTGGGATTAG GAGGAGGGCC 
20601 TGTAACTTCT TTATAAAAGT TTCTTAGCTA TCCTGAAGAT GTATAGACAT 
20651 TTTTACTTTT TTAGGTATTT TCAACATCAG AAATTCAAAA AAGTCCCCAA 
20701 AGATTCTTCC AGAGAAGCCC TCTTTTCTTA CAATCTTATC CCTGGCTATC 
20751 TGCGTAAACG GAATCTTGAA CCCATAATAG GATACATGTA TAAAATCTTC 
20801 CTTATTAAAG CAGAAATAAA TTGTACAGCA TCAATATCAT TTT ATAATCA 
20851 TAGGGAGGCT TCTTTGTTTA GCATGTAATG CCCCCTTTAC AGGCTTTTTG 
20901 TTCTTTGAGG GGTTTGAACA TTCCATGAAA AACTGACAGA TAGGAAACTG 
20951 ACAATAAAAG ATTGAGCTAA AGATGGAAGC AGAAAGTACT AGGCTAGATA 
21001 GTCTCTAAAC ATTAAGTATT TTCTTCCTCC ATCTTAAAAG CAATGA GAAG 
21051 CCACCAAAAT ATTTTACCTA ATGGAAACCT GATTGCCGCA TTTTTGTAAC 
21101 CACCACTTTG GCTGCTACAT AGAGAATGGA TTAGAAGATG CCAACAAAAG 
21151 ATTCTGAGCA AGTCTGTAAA TCTGATCAAG TGTTCTGATG CAGGCTGATA 
21201 TCCTTCTGTG CTAAGAGAGA TGATCCTTGG AAAATCCAGA GCCAGCTCCA 
21251 TAATACTTTC CTGCTCTGCT GGCAAATCCA CAAGCTGCTG GCCCCTGGAG 
21301 CCATTCTTCT CTCAAAACTA GCATTCATCA ATTTAATGTA TACGTATTGA 
21351 TGGGGAATAA TGGTCACTAT GAAAACCATG TGATAATATG GAAAAATACC 
21401 CATGATATAA TGTTATGTGA AGAGAAGAAA ATGAAACTGG TAGAACTATG 
21451 TGATTGCAAA TATATACAAA TATTAAAACA ATTATATGAC TTTATAAAAT 
21501 ATTTGTATAT AATGAAAACT GAAGCAATAT AAAAAATAAA ATTAGTTGTG 
21551 TCAGGGTAGT AACATGATGA GTGATTAATA GTTTTTAATT TTTAATATAG 
21601 TAATGACATA ATGTTACAAC TTGTCCAAAT CTCACAAACA TAATATTCAG 
21651 TAAAGGAAGA TAAACATAAA AGAATACATA TTTTATTATA CAl I 1 1 lATG 
21701 TAGGCTAATT GATGGTTCTG AAAGCCTTAA AAAGCTTACT TTTAGGAGGA 
21751 GAATCATGCC TTGGAGGACT CTAGGGTCCA GAAAAATGTC CTAATACTAG 
21801 AGCTAGGTGC AGTCAGATTA ATTATAATAC ATTTCATTAT TTTGTCTGGA 
21851 ATACCAAGAT GACTTCCAAG CAGGAATGGA GTCTAGCAAC ACTTTACTGA 
21901 TGGGGAACTT GGCCACAGAC TTGTAATACA AAI I I I IGGA TATGTTGACA 
21951 ATGTTTCTCC TTATTTTTCT TACTTATACA AAGCAAGAAA TTTGGCTCAC 
22001 AACCTTGAAA CAGACTTACC AGGTTCCTCC AGTTTCCCAA GCCTCAATAT 
22051 CTCATTGCTA TTTTTAA 
(SEQ ID NO: 3) 

SNPs: 

DNA 

Posi ti on Maj or Mi nor 
165 G A 
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Context: 
DNA 

Position 

165 TTATGGCCT AACCTTTTTMCTTTGAGTTATTTTO 
TTCAGGAGAAAGAAGCAATCCMCAAACM 
TTTTGAATAGGACATTGGMGAAAAATAATAATCAl I I I lACAG 

[G,A] 

TAGATCCCAAAGTCAAGGATCTATGTTCAACCATGTGTG^ 
ATGAGTMCCATCATTMGCAGTTAGCITAGGCCGTMTATGATTCTTGGACTG^ 
CAAAMTACCACAGGCCTTCTGAMGGTTACCCCTTTCT AGCTCCACTATCATCTAATTT 
TATTAAAAAAAAAAAAAMGGAAAMTTTGAGCTTCTAGAGAGTAGG^ 
TATCCCACAGGGCCAAGGMCAAGTTTTMTCT 

226 TTATGGCCTMCCTTTTTMCTTTGAGTTATTTTCM 

TTGAGGAGAAAGAAGCMTCCMCAMCAAAMGATMCCACACT^ 

TTTTGMTAGGACATTGGMGAAAMTMTMTCATTTTTACAGCT 

AGGATCTATGTTCMCCATGTGTGTTCC^CCATCTTCACAATTGA 



FIG. 3-12 



Docket No.: CL001186DIV 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS, ... 

[A,G] 

TG/VGTA/VCCATCATTAAG^ 

AAAMTACCACAGGCCTTCTC^^ 

ATTAAAAAAAAAAAAAMGCWW\ATTrGAGCTTCTAGA 

ATCCCACAGGGCCAAGGAACAAGTTTTM^^ 
ATTGAMTATATMTAGAMTATTGTAACATTATATATTTTCTA^ 

TTATGQCCTMCCTTTTTMCrrTGAGTTATTTTG\A 
TTGAGGAGAAAGAAGCAATCCMCAMCAAAAAGATAACCACACTGTMT^ 
TTTTGAATAGGACATTGGAAGAAAMTMTMTCATTm 
AG^TCTATGTTCMCCATGTCTGTTCCACCATCrrCACMTTGM 

[T,C] 

MCGkTCATTMQCAGTTAQCTrAGGCCGTMTATGATTCTTGGA 
TACCAC^GGCCTTCTGAAAGGTTACCCCTTTCTAGCTCCACTA^ 
AAAAAAAAAAAMGGAAAAATTTGAGCrrCrAGAGAGTA^ 
AG^GGGCCMGGAACAAGTTTT^ 

AATATATMTAGAMTATTGTAACATTATATATTITCrATATACT 

QTTGAGGAGAAAGAAQCAATCOVNCAAACAAAAAGATAACC^ 

TGTTTTGMTAGGACATTGGMGAAAMTMTMTCATTTTTAC^ 

CAAGGATCTATGTTCAACCATGrGrGTTCCACCATCrrC^ 

ATTAAGCAGTT AGCTTAGGCCGTMTATGATTCTTGGACTGAGATTTCA^ 

GGCCTTCTGAAAGGTTACCCCTTTCT^ 

[A,-] 

AAAMGGAAAMTTTGAGCTTCrAGAGAGrAGGQGCTACCATTTTGW 

AAGGAACAACTTTTMTCTATTCATT^^ 
ATAGAMTATTGTMCATTATATATTTTCTATATACT 
TACAGMTATATTATTAMTATTGTAGAACAATATA^ 
CAGTMTATATTAAATACTTATTAAMTAGCAAGCTTATATA 

GCAGTTAGCrrAGGCCGTMTATGATTCrrGGACTGAGATT^ 
TCTGAMGGlTACCCCTTTCrAGCTCG^CTATCATCTM 

AGGAAAMTTTGAGCrrCTAGAGAGTAQGGGCrACG\TTTTGT ATCCCAG^GGGCCAAGG 

MG\AGTTTTMTCTATTCATTTAMTTMTTTCA 
AMTATTCTMCATTATATATTTTCTATATAaTTTATTATATAGAAM 

[G,T] 

MTATATTATTAMTATTCTAGAACMTA^ 

ATATATTAMTAiCTTA7TAAMTAGO\AGClTATATA 

GAMGTTTCAGCTITATTTCTTTGAG\TTACn^ 

QGMTTGTCCAGATTATTCAMTMCTGGAAGTTGA^ 

AGAAACTCTTTTAAGATTTGAGCT 

AGGCCTTCTGAMQGTTACCCCTTTCTAGCTCCACrA^ 
AAAAAMGGAAAMTTTGAGCTTCTAGAGAGTAGGGGCTACCAT^ 
CCAAGGAACAAGTTTT AATCTATTCATTTAAATTAATTTCAGT^^ 
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TMTAGAMTATTCTMCATTATATAT 

ATTA(^GMTATATTATTAMTATTGTAGAACMTATATMTACAGAAAMTATATAATA 
[C,T] 

TCAGTMTATATTAMTAOTATTAAMTAGCMGCTW 

TTGTGAGAMGTTTCAGCTTTA I I ICI I IGACATTACI I IGI 1 1 CTGCACAAACAAAAGA 

ATTACAGGMTTGTCCAGATTATTOWtf^^ 

TGATGT AGAMCTCTTTTMGATTTGAGCTAGCCTACAATCT 

GAACTATAmGTGCTATTTCCATATTMGTCAAGGCMC^ 

1621 CGGCTTMGCTCCACAGGCATACAMGTGMG(^^ 

TATCTGGTATCTCATGTGGGGCTTAGAGGT AAATTGTCGTT ATTTGGCCrCCATTTCTGC 

CTTT AACCACTGGTGT AMCAMGGTTACTGTGCCAMGTTGACAGGV^CCCAMTCCCT 

TTGGCATGTGMTTAGTTTCCTCTGCCA^ 

GATTTAGGAGTCAGGGTTGCCTCAT^ 

[A,G] 

CACTGCATGGTTGGC^CTAGTTCCrTGATATATGTT ACTCCGTTTGATCCTCATGAAGGA 
t: TCAMTGGQGAAGGGAGATACTATTGTCTCTGATTGTCCATTMG^ 
g TA(XTCCCTGTTTGACACACrGGTTTGAAMTGTTGCTM 
g TACTCAGTGGAMCATGMGGATTCCGT^ 
2 ATTTCCCMCCTGCMGTGCATCATG^ 

O 2330 AAMGTTG^GMGTTCCTG^TOVVrAAGGAGT CCTTGTGAGC^GGTGA^GCTCATCrAAC 
m TAGGTAAGATGAAGATCTATrATAAO^GGAG^ 

CAGTCAGGTGCMGAGCTCTGCAGTGAGGCT 
^ T(XTGGCTCTGTGACOTGAGCAGGTCrTAMTCT 
^ ATAAAATGAGGATMTMTAGTACCAAMTTAGGGAGATm 
~ [C,T] 

jij GTGAACT ATTTAGAGT AATGCCTGCCATMGGGGACTCAGTAGC^ 
rf ACMTTTGAAMGmCATMTATTTGCAGAW 

TGTATGCAMGCTATTTAGCTTCAGM 

I I IGI I ATAGTGMTTATCTGTMTATTTATCT 

GWvGCATTTATTMGMCrTACACrGCACTAM AATCCTCACT A 

2498 AGATCTCTCACCTCTTGGCTCrGTGACCrrGAGCAQ 

TTTTTTTMTTGATAAMTGAGGATMTMTAGTACCAAMTTAG^ 

TTAAATMCATACGTGMOATTTAGAGTMTGCCTGCCATMGa 

TTATTAGrrTCATACMTTTGAAMGTTTCATMTATTTGCAGA^ 

ACCAGATAGCTMTGTATGCAMGCTATTrAGCrrCAGM 

[A,G] 

GTTAAATATTAC I I IGI I ATAGTGAATTATCTGTAATATTTATCTCTTGCrCAC I I I IAT 
MGAAAMTAGTGAMGCATTTATTMGMCTTACACTGCACTA^ 
TMTCCTCACTATMCCCTATGAGATAGGTTACATTATTGTC 
AMCCAAGAGACAAAGCT ACT AAMCACTTGCCTGAGGTTAGACATC I ICI ICTGTGGTG 
AGGCTGGATTTCAMTTTAGACCAT1TGACT 
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2791 TTCTAGAAGTT AAATATTAC 1 1 IGI I ATAGTGMTTATCTGrMTATTTATGrCTTGCTC 
ACTTTTATMGAAAMTAGTGAMGCATTT^^ 
ATATGACrrMTCCTCACTATMCCCTATGAGATAGGTTACATTAT^ 
TMCMGGAMCCMGAGACAMGCTACTAAM 1 1 CI I 

CTCTOJrGAGGCTQGATTTOW^TTTAGACC^m 
[T,C] 

GCTGTTT AGTGTTATAGTGTTGGTCT ACCTTrGMTAGACATACrrTTAAACCATGGO\A 

GGMGTGAGACTGCACATTGAMTATGTAAMTTTGCCTT^ 

GT(>CATOVCTAGAMCJMTCATMGCTTTTGrGTT^ 

I I I I LI IGI I I ACrrTGTGGGATACTGGGCTTAACTAGGGGATACCTCCAL I I I I I ACTT 
GGCGVTGGTATGAAMCCrGTCCrcrGMTCTTT AGATATTTTGGCAAATTGT AGGCAAA 

2877 ATTTATTMGMCrTACACTGCACTAAAT^ 
TATGAGATAGGTTACATTATTGTCCT^ 

ACTAAMCACTTGCCTGAGGTTAGACATLI I CI I CTGTGGTGAGGCTGGATTTCAAATTT 
AGACCATTTGACTGTAGCACTTATAT^ 

TACCnTGMTAGACATACnTT AMCC^TGGCMGGMGTGAGACTGCACATTGAAATA 
[T,C] 

GTAAMTTTGCCTTTQGGTGCCACCTGAGAMTAGTCACAT 

GGTTTGTGTTTGGTTAMGrrTTTATTGATCCA 1 1 1 1 1 CI I (a 1 1 I ACTTTGTGGGATACT 
GGGCTTAACTAGGGGATACCTCCACI 1 1 1 I ACTTGGCCATGGTATGAAAACCrGTCCTCT 
GMTCTTTAGATATTTTGGCAMTTCT AMGCAATTCAACCTTG 
ATTAAMTMGACGVW\ATGCCTCCATACrrGATTAM 

2879 TTATTMGMCTTACACTGCACTAMTGTTATA^ 
TGAGATAGGTTA(^TTATTGTCCTMT^ 

TAAMCACTTGCCTGAGGTTAGACATC I ICI I CTGTGCTGAGGCTGGATTTCAMTTTAG 
ACCATTTGACTGT AGCACTT ATATGATGAGC^TGCTGTTTAGTGTTATAGTGTTGGTC^ 
CCTTTGAATAGACATACTTTTAMC(^TGGCMGGMGTGAGACT 
[T,C] 

AAMTTTGCCTTTGGGTGCCACGTGAGAMTAGTCACAT^ 

TTTTGTGTTTGGTTAMGTTTTATTGATCCA I I I I ICI IGI I I ACTTTGTGGGATACTGG 
GCTTMCTAQGGGATACCTCCAC 1 1 1 1 1 ACTTGGCCATGGT ATGAAAACCTGTCCTCTGA 
ATCTTTAGATATTTTGGCAMTTGTA^ 
TAAMTMGACCAAAMTGCCTCCATA^ 

2912 TATGACTTMTCCrCACTATMCCCTA^ 

MCAAGGAMCCMGAGACAMGCTACTAAMC^ AGACATC I ICI IC 

TGTGGTGAGGCTGGATTTCAMTTTAGACCATTTGACT 

GCTGTTTAGTGTTATAGTGTTGGTCTACC^ 

GGMGTGAGACTGCACATTGAMTATGTA^ 

[A,G] 

TCA(IATCACTAGAMCTMTCAT^ AAAGTTTT ATTGATCCATT 

I I ICI IGI 1 1 ACTTTGTGGG^TACrGXjGCTT AACTAGGGGATACCTCO^CTTTTTACTTG 
GCCATGGTATGAAMCCTGTCCTCTGAATCTTTAGATATTTTG^ 
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AMGACrrAMGCMTTCMCCTTGATTAAMTMGACGW 
TAAATTTATTTCATTTTAGGMCTGGATTATMTCAAGAC^ 

3076 CrrATATGATG^GCATGCTGTTTAGTGTTATAGTGrrGGTCT 

TTTTAMCCATGGC\AGGAAGTGAGACTGCAG\TTGAAATATGT AAAATTTGCCTTTGGG 
TGCCACGTGAGAAATAGTCACATCACT AGAMCTMTCATMGCTTTTGTGTTTGGTTAA 
AGTTTTATTGATCCA 1 1 1 I IU IGI I 1 ACTTTGTGGGATACTGQGCTTMCTAGGGGATA 
CCTCCACI 1 1 1 I ACTTGGCCATGGrATGAAMCCTGTCCTCTGMTCrrrAGATATTTTG 
[G,T] 

CAMTTGTAGGOV\AOW\GAClTAMG^^ 
TGXXTCCATACTTGATTAMTTTATTT^ 

TCTACATGAAAAMTAGATTMTAGTGOT 1 1 1 I 

ATAGVTTATCTGCCTTCGGTGTTATTCAAGriTT^ 

^TTTTATTTGkTrMTCAACATTGATAGTrAAMTTMTCTCT 

u 3745 TGOTGGATTCCTTGATTTGGAAMTGMGTGMTCCTGA 
h TCATGGAAMCTGTGMGMGCTCAMTAMG^ 

S AGGTCCTGTTGTMCAGAAMTCrcrGATAAMGVGATAAMTGTAGATQj I I I I IAACC 

D TCTGCMGACTCM(KXAGTTA(^T^ 

yj MCCAMTTGTGCTATTGTGCTATCTATCTATCrA^ 

W [C,G] 

O TATCrATCTATCTATTTATCTATCTATCT 

™ TTTAAGAATATCAAGCTA I I IGI I GATATACATGATTGCCTTCrATTGATCTATAGTTCT 

: ATTACrrrTAMGCMGAGGGGTCTGW\AGAO\ATTGA 

GAAAGMTGGGTCMTGCTAMTTTTCCCCCAACCCCCCAAMTATTAGC 
" TAI I I 1 1 lAAMTTCTACrTATTTTGTATTMGACTTTATTTATTMTTTT 

ff! 

5 3752 TTCCTTGATTTGGAAMTGMGTGMTCCTGAGGTGTGGATGM 
H> AMCTCTGMGMCATCAMTAMGCAGGACTMTGGAGTATGA 

GTTGTMCAGAAMTCTCrGATAAMCAGATAAAATGTAGATGG 1 1 I I IAACCTCTGCAA 

GAGTCMGCTAGTTAGATCrrTGTCTGAAAMCAMTACT 

TTGTGCTATTCTGCTATCTATCTATCrATCrATCT ATCTATCT ATCTATCTATCrATCr A 
CT,-] 

CTATCrATTTATCTATCrATCTATAGATAGMCCrCCT 

ATATCAAGCTAI I IGI I GATATA(^TGATTGCCTTCTATrGATCTATAGTTCrATTACTT 
TTAMGCMGAGGGGTCTCAAMGACMTTGACTTGATMTATAGCrrTGTC^ 
TGGCTCMTGCTAMTTTTCCCC^ I 1 1 I 

TAAMTTCTACTTATTTTCTATTMGA(TrrATTTATTMT^ 



3762 TGGAAMTGMGTGMTCCrGAGGTGTGGATGAATACTGTMGT(^TGGAAMCT 
GMCATOWVTAMGCAGX^aAATGX^^ 

AAAATCrcrGATAAAACAGATAAAATGTAGATGG I I 1 1 I AACCTCTGCAAGAGTCAAGCT 
AGTTAGATCTITGTCrGAAAMCAMTACTGTCCGGT AATGAAMCCAAATTGTGCTATT 
GTGXTATCTATCTATCTATCTATCTATCTATCT 
[~,C,T] 
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ATCTATCTATCTATAGATAGMCCTCCT 

ATTTGTTGATATACATGATTGCCrrCTATTGAT^ 

AGGGGTCTCAAMGACMTTGACTTGATMTA^ 

CTAMTTTTCCCCCMCCCCCCAAMTATTAGCCMTAGTAGATA I 1 1 I I IAAAATTCTA 
CTTATTTTGTATTAAGACTTTATTTATT^ 

3833 AAAGCAGGACT MTQGAOTAT^GGTTACGAMGGTCCTGTTGTMCAGAAAATCTCTGA 
TAAMCAQVrAAMTGTAGATGGTTrTTMCCTCT 

TGTCTGAAAMCAMTACTGTCCGCTMTGAAMCCAMTTGTGCTATTGTGCrATCrAT 

CTATCTATCTATCTATCTATCTATCTAT^ 
CTATAGATAGMCCTCCTCnTrGMTTTATGTTTTM 

[A,G] 

TACATGATTGCCTTCTATTGATCXATAG^ 
MGACAATTGACTrGATMTATAGCTTTGrCAGAAAGAA^ 
CCCAACX^CCCCAAMTATTAGCCAATAGTAGATATTTTTTA 
TTMGACTITATTTATTMTTTTAG^GTTACCT 

CTMTAAGCACACAACAGATC bl I IG I II 1 GATTCCTTTTT ATATCCTTTGGAGAAGTTC 

4399 GTTTTGATTCCTTTTTATATCCTTTQGAGAAG 

CAGAGTGAMTCATCATCTAG\ATGGCTACCCCAGTGAA 

GATGQGTATATACTCCTTGTCAACAGMTTCCTTATGQGCGA 

GGTAGW^TATGTCTCrCCTGAAMGGGGACTGCATTGACCTCCTO 

ATTTMTGCTAGATATGCATCM 

[T,C] 

CITTAMTAGTTATCAGGGAGGCTCACTCTTTGCCTGATM 

mcctaaamtagw^cagcaagactgatctrgctmctgcm ag 
ggtgtamcagaaaggcagagcctgcattttgtc^cctca™ 

AMTTGOTTGTCC(^GGAAMTGGATCCTCTCATTGTCAGMGGA 
TATGAMTTGACTCrGGGGCACCCMGMGMCCTCTCCTGCTCCCACTAAMTTM 

4945 AATTGACTCTGGGGCAGCCAAGAAGAACCTC 

CCTCTGCAGGATAAAAMCMTCTAGTTAMTGACMCGCATITCT 
GACTGAAAACCTT AACATCCACATACACTTTGATCTMGGGACAGACGGITCATAGA^ 
AMGAGTATGGTGTCAATMGGCTTGMTTCTAGMTGAGGAGCCAG^ 
GGGGMTGATACTCCTTAAMQGGAAMTTTMCTACAMTCCTCTGMGTA 

[A,G] 

AGMTMCCAAMTATCTGCMTGGTT(^TAGCAMTMTTTATTGG(^G(TG^ ACC 

GTGTTCATTTTGCATCTTTTTTCCCACG^CAC^ 

GACATTCTCTCCCTCITTTATCTCC^GTTTCAGMTG 

AGTTTTAlGTVGTTAAAATATGAAAlCACCCAGTT^ 

TMTTTTGTATAAGTCTCATTTTAAGATM 

5056 GTTTTCCAGGACTGAAAACCTTMCATCCACATACAGTTGA 

CATAGAATGAMGAGTATQGTGTCMTMGGCTTGMTTCTAGAATGAG 
GCCATAGCAGGGGMTGATACTCCTTAAMGGGAAMTTTMCTACAM 
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AGAMTGATMGMTMCCAAMTATCTGC^ 

GCTGaTACCCTGTTCATTTTQCATLI I 1 1 1 I CCCACCACACATATTAAGGAGCAGCTGA 
[A,G] 

GTCATGTTTGACATTCTCTCC^ 

GATATGAGTAGTTTTACTAGTTAAMTATGA^ 

MCMOW^TMTTTTGTATMGTCTCATTTTMGATMTACT 

TCACT ATTATCACT ATTTATAAMTTTTGrAGAGGVTCCrGGATL I 1 1 1 IGCTTACTTTT 

Gl 1 1 1 IAI 1 1 I I IGCTAMTCTGGCMTCCCAGG^CATGTOTGMC^GCrGrGAAATA 

5280 AMTMTTTATTGGCAGCTGCrTACCGTGTTCAT^ I 1 1 1 1 1 CCCACCACACAT 

ATTMGGAGCAGCTGMGTCATGTTTG^ 
ATGAAAMTGAGAGTGAGATATGAGTAGTTTTACTAGTTAAMW 
MTTTGAAGGTCAGATAMCMCAMTMTTTTGTATMCT 
AAAMGTCATTATTTATTCACTATTATO 
[T,A] 

u, CI 1 1 I IGCTTACI I I IGI 1 1 1 IAI 1 1 1 1 I GCTAMTCTGGCAATCCCAGGCACATGTGTG 

q MGGAGCTGTGAMTATAAMGGAGAAMCTTTT ATGGGAMGATTTGGCTT AAGGAGAG 

Q ATMTTTTGGAMG^TTrAGMTTAMGATCATTCATTAGATGTM 

6 TATATCAGTTAAACTTCTCATCM^ 

yj ACAMTGATGAM7TMGGCACMCCGGTTATC 

O 5790 TGAGATGGGTACCACTMTAGTCACCATTTCACAMTGATG^ 
TATGTTMGAGGCCTAMGTCCACAMTAGCMGCTGAC^ 
CATGCTGGCTCCAGAGCCTGTGCTCTTAGTCA^ 
CACCCTGGTTACTTTGGATCTCCCTGAATGCrCTCT 

GCAGAGGGACACTGAGCTGAGCATATTATTGTAG 1 1 1 1 I AAATGCTCTCCACTGGACAGA 
[A,G] 

GATGGGGGATTTGMTAGAMTTTGGTGAGGAACT AATCAGTGTCCATTTACACTCACCT 
CCTCITCCTCCCTGGAAGAGCTATAGGACTTGAGTMG^ 
TAMCCACACCCAGGAMTTTGTATATACAMTACATAGAGCACAG^^ 
GAaTTGACATAAAMGMCrGGGTTTGAGTCCCTGCTCTGGCL I I CI I ATCTGGGTGGC 
CCTCTGGGAAAGTTACTTMCTACATAAAGI I I IGI I I CCATATCTACAAAATGAGGTTT 

5901 MGCCCAGGCATGCTGGCTCCAGAGCCTGTGCrClTAGr^ 
TTGACCTTCCACCCTGGTTACTTTGGATCTCCCTGMTG 

TGGMGTTGGCAGAGGGACACTGAGCTGAGCATA-rTATTGTAGI I I I I AAATGCTCTCCA 
CTGGACAGMGATGGGGGATTTGMTAGAMTTTGGTGAGGMCTM 
ACACTCACCTCCTCTTCCTCCCTGGAAGAGCr ATAGGACTTGAGT AAGCATGATAAATTT 
[C,T] 

GTGTCTTTGTAMCCACACCCAGGAMTTTGTATATACAMTAC^^ 
ATCAGGA(^GACTTTGACATAAAMGMCTGGGTTTGAGTCCCTGCT I CI I AT 

CTGGGTGGCCCTCrGGGAMGTTACTTAACTACATAAAG 1 1 1 IGI I I CCATATCTACAAA 
ATGAGGTTTCTCAAMTAGCAGCTAGTTT ATAGAG I IGI I GCAAGAATTTAGT AAGCTAA 
TACATATAMTACGTCMCATAGCACCAGGTACAAAMTATGT 
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6457 CMCATACCACCAGCTAOWW 

CTCTATACTATTGACAAGGGAAAAGTGAAAACAG I 1 1 I Id 1 1 1 1 ACCATGTGTGTATGTG 
TGTGTGTCTGTGATGTTTCCGACATGCrCTATTTMCATAAATTACrCT 
TCTCTCTCTTTCrCTTTCTCCCrcrCTCATC 
AGTTGTGTATATGCAGCATGCCCTGTTTGG^GAG\AT(^ 

[C,T] 

MTGGMGCCTTGGATTCCTTCTAGCA^ 
CGGGGAMCACTTGGTCM^ 

GCCTTTAGGT AMTATTAGCTMGAAMCTCAAGGGGGAM 

MTMCCTGGACGCTATTMTGATTAT^ 

TTTCTGTITMGATCTCAMQGAQGGrMCAGCMGAA 

TTCTCTCTCTCrCTTTCTCTlTCTCCCT 
CGGCG^GTTGTGTATATGCAGC^TGCCCTGTTTGCAGACAATG^ 
TATGCCMTGGMGCCTTGGATTCCTTCTAGCAGATGCAGGTTATGATCT 

AACAGTCGGGGAAACACTTGGTCA^ 
TTCTGGGCCTTTAGCnAAATATTAGCTA^ 
CT,A] 

AAAAAMTMCGTGGACGCTATTMTGATTATCmGACGCrrGMCTG\TATAG(TC^ 
TCTAGTTTCTGrrA/VGATCTCAW 

TTCTCCCACMGCAMGTATGGCATTTCMCMGATCATTTTTACA^ 

TTCTATGCATTAAAAGTATGTTX^ 
AGkTTCATTCAGCCAATGTITTACTGAGTGGCTACT 

6763 AAGCCTTGG^TTCCrrcrAGCAGATGCAGGrrATGAT^ 
AMCACTTGGTCM(W^GACACAAM 
TAGCTAMTATTAGCTAAGAAAACTCAAGG^^ 
CGTGGACGCTATTMTGATTATCTTTGA 

GTTAAGATCrCAAAGGAGGGTAACAGCAAGAAGCrCTGA I I I I I CACTGATTCTCCCACA 

[A,G] 

GGV\AGTATGGCATTTCAAG\AGATGVrTTTTACATCCM 

AAMGTATGTCCAM(^GACAGCT<^GGAMTTATCATGACCMTOT 

GCCAATCTTTACTGACTGGCTACT 

GMCAGACAAACTCTGACCTCACAMGCTTA^ 

ATTGCTCCTGGATTGCCMTCMCrGTGTAMGATGATTTGGACCAGGACCl^ 

6955 TMTGATTATCTTTGACGClTGMCTCATATAGCTCCrrGTAGTTTCT 
MGGAGGGTAACAGCAAGAAGCTCXGATTTTTC^ 
CATTTCMCAAGATCATTTTTACATCC^ 
CAMGAGACAGCTCAGGAMTTATCATGACCMTGTGCACAT^ 
CTGAGTGGCTACTGTATGCGCTGTTCTAGGCCCCGMCATT 

C-,T,C] 

TCTGACCTCACAMGCTTATGTTCAT^ 

TTGCCAATCA^CTGTGTAMGATGATTTGG^CCAGGACCTT ATTGATTTAGAGAAACTGT 
GATTGATTTAGAGAMCTGAGATCGCACATAGTACCATTTT<^ 
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GATTTTTAAMCCTTGTTAATGGGCAATGAAGAAGAATCI I I I I IGATATU Ibl I ICI I 
TTMTGGMGAGTTTTCTGCTGTCACO^^ 

7017 GGAGGGTMCAGCAAGMGCTCTGATTTTTCACTGATTCT 

TTTCM(^GATCATT1TTACATCCMTTCTCTGMTTCTATGCAT^ 

MGAGACAGCTCAGGAMTTATCATGACCW 

GAGTGGCTACTGTATGCGCTGTTCTAGGCCC^^ 

CTGACCTCACAMGCTTATGTT^^ 

CT,G] 

GCGV^TCMCTGTGTAMGATGATTTGGACCAGGACCTTATTGA 
TTGATTTAGAGAMCTGAGATCGGVCATAGTACCATT^ 

TTTTTAAMCCrTCTTMTGGGCM I I (j I I I L I I I I 

MTGGMGAGTTTTCTGCTCTGACCAC^GGACAGGCTGATGCCrGCGATAGALI I I IU I 
TCTTCAGGCCTMGCTCCCrGTTGGT^ 

= y 7151 GAMTTATCATGACCAATGTGCACATTCATTCAGCCM 

O TGCGCTGTTCTAGGCCCCGMCATTCAMC^ 

2 CTTATGTTCATTTTAGTGATMTTTTAC^ 

H GTAMGATGATTTGGACCAGGACCTTATTG^ 

JJJ ACTGAGATCGCACATAGTACCATTTTG^^ 

S [G,T] 

m TTAATGGGCMTGMGMGMTCTTTTTTGATATCI IGI I I <_ 1 1 1 I AATGGAAGAGTTTT 
CTGCTGTCACCAGAGGACAQGCTGATGCCTGCGATAGACI 1 1 1 LI 1 1 CTTCAGGCCTAAG 

N= CTCCCTGTTGGTTTGTAMCCTGATGCTAGMCAGACTCT 

Fy AM(^TTCAGTACCCACTGAMGTTTG^ 

O TCTGAGTTCTTGGGCAGGGGCMGCATC^^ 

5 7308 CTCCTGGATTGCCMTCMCTGTCT 

GAAACTGTGATTGATTTAGAGAMCTGAGATCGCACATACT 

0\ATATTAGATTTTTAAMCCTTGTTAAT^ I I I I I I GATATCT 

T GTTTCm I M TGGAAGAGTTTTCTGCTGTCACCAGAGGACAGGCT 

GA L I I I I L I I I L I IC AGGCCTMGCTCCCTGTTGGTTTGTAAACCrGATGCTAGAACAGA 

[C,G] 

TGTGTATTCCTATTACATTMTAAMCATTCAGTACCCACTGAM 
AGGAATAGMTAGAATGTTATAGTCTGAGTTCrrGGGCAGGGGCM 

TGMTCATTACTCTTTAGGAGCTCTCACMC 
ATAGATTTCCTCACATGTTCTTTTMTAMCAGGCTTCTAO 

GACT AAATGTTATATAGGCCL I I I IGI ICCTCCTGTCTGAAGAACAAAATALTAGTACTA 

7321 MTCAACTGTGTAAAGATGATTTGGACCAGGACCTT ATTGATTTAGAGAAACTGTGATTG 
ATTTAGAGAAACTGAGATCGCACATAGTACCATTTTCAGGAAM 
TTAAMCCITGTTMTGGGCWGMGMGAATClTmTGATATL I IGI I I LI I I IAAT 
GGMGAGTTTTCTGCTGTCACCAGAGGACAGGCTGATGCCTGCGATAGAL l l l l Li I iCT 
TCAGGCCrMGCrCCCTGTTGGTTTGTAMCCTGATGCTAGMCAGACTGTGTA 

[T,C] 
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TACATTMTAAAACATTCAGTAGCC^ 

MTGTTATAGTCTGAGTTCTTGGGCAGQGQCMGCArCA 

TTTAGGAGGTGrCACMCMTTCTCCrATTCTTGTMGrCCCMTCrATAGAT^ 

CATGTrCTTTTMTAMO^GGGrTCTAGGrrATGGM 

ATAGGCCCrTTTGTTCCTCCTGra"(W\(W\CAAAATACTAGr ACTATGGAATATTGGTA 

GCGATAGA LI II III I I C I I CAGGCCTAAGCTCCCTGTTGGnTGrAAACCTGATGCrAG 

MGKGACTGTGTATTCCTATTACATTMTAAMCXIT 

ATACTC^GGMTAGAATAGM^ 

GAMTATTGAATCATTAGTCTTTAGGAGGTGrCAC^ 

CCAATCTATAGATTTCCTCACATGrTCTTTTMTAMC^^ 

[C,T] 

TGATTTGACTAMTGTTATATAGGCCCTTTTGTTCCTCCTGTC^ 
GrACrATGGMTATTGGrATATATTAMTATATATCTATATATCGa-GrGGACAGGAATA 

CTACT ACTMCMCATCTTACTGAGCAOT I I PCI I iCATACT 

ATTAMCCCCGTTAGCAGCCCCGTAMCCAGGrACTACCCTGTTTATrrCCCAAATGAGA 

AAACATAGGCTCAGAGCATTTCAGrMTTTCTCMGAGTTGG 

ATAAMCTGCTCAGGAGAAATTCTATTTQ 
TGTTTATGAGGGTCACTGTTAGG^^ 
GAGTTCACmTATGTTGGMTAAMCMCTGTTACrrATAC^ 
CTCnOCTCQGMTMCCCTACTACTCrAAGTAGCT 
TGTAGGGCAMCCXITCCTGGGTCTCT^^ 
[T,C] 

TTCCCAGGMTAACATGTGTTCCAMTTC^ 
ATTCCCTCTGAGCTGAAAAAGTAAAATTCMTGCCATGGMTAT^ 
ATGTGCAT^TCATCTCriTCrCACAACCGWkTGGGAl I I I I AAAAAATAAAAGGGAA 
GGGCTTATACCrATATTTAMCAMTTGAAMGGGVTGGTTATA I ITGTI iGTGAGTTGG 
AAOVCACAAGCTTACTATAATAAATCAATTGAGCTTATCT 

TMGTAGCTGTGAGCCrGCAGTGCACAGACrATATGTAGGGCAA^ 

TGGTCACAGCAGCATATTGACTACGGTGATGC^ 
ATTCAMGAAATMTTCCACAGAGTMGTTTCTAGATTCCCrCr 

ATTCAATGCCATOWVTATGGCTGAAAW 
CAACCCAMTOiGGATTTTTAAAAMTAAMGGGMGG 

[C,T] 

TGAAAAGGCATQGTTATATTTGTTTGTGAGTTGGMCACAC^ 
MTTGAGiCTTATCTATTCAGTGTGT^ 

(^CTATCTAGAMTTTCTAMGTTTTrrMGCrGACMCTTAC I ICI I AATTTACTTACT 

TTACrrMTTTACrrTACAATTTACrTTCCAGGTATT^ 

TTCCMGTAAMGTTGAMGGMCCCACACTMTA^ 

AAATGTGCATCMT^TCTCriTCJCACMCCCAAATGGGAl I I I I AAAAAATAAAAGGG 
MGGCrrTATACCTATATTTAMCAMTTGAAAAGGCATGGTTATA I ITGT1 iGTGAGTT 
GGMCACACMGCTTACTATMT^ 
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TATTTATGAMTAGCAAGTAAATGTAAGCACTATGT AGAAATTTCTAAAG 1 1 I 1 1 lAAGC 
TGACMCTTACn'CrrMTTTACTTACrTTACTTMTTTA 

[G,A] 

TATTTTGGAMGAMTCMTMTCTAGTTCCMGTAAMGTTGAM 

TAAMGCITTGAATTTCTC^^ 
CATGTGAMGTGGVVTATTTCAGTTTAGGGAMT^^ 
TMCAMCATATATTCATTACTAT^ 
CACTTAGCATCAGTCAGCATAT^ 

GTTTCATTTAGGACATAMTATTTTTACT 
TCTGTTATGTMGGAGCACCCACrcriTGTAQGACA^ 
CAGGGCTCTGCAGTCAGCGTGACCC^^ 
TCTGGGGMGTACTATTCCTGATT<^GACT 
AGrrcrrcrrTTTCTGGCCMGACAGTTTTMTG^ 
[T,C] 

A(^(^CACAG^CACA(^CA^CACACA<^(^CAG\CATGCr AGTGGAGGCCCAGGAAQGG 
ACCrCTGGAMCCAMTrATATGGATATTCTCCCTAGCCTACCCAGTGrrGTGCrM 
CCATCCTCACAGATATACAMGGGGTGC^ 

GATGCCrGGTCCTTACTGGGCG^TCGTGGATGCTAGGGAAAGCCCL I I ICI l I I iGGAAA 
CAGGGMGAGTCTAGAGGGTTGAAAAACACC^ 

CATTTTGGACAGAGCMTTTCTGTTAT^ 
TAGGTCCCAGCCCATTAMCAGGGCTCT^^ 

CACACATTTCCAAACACCCTCTGGGG I l I I lATCAA 

TTGTTCAGTCAATTATTTCA Cj I I L I I L I I I I I CrGGCCAAGACAGTTTTAATGTTCCAAC 
MGTGTTTCAGTACACACATACACAC^^ 
[C,T] 

AGTGGAGGCCCAGGAAGGGACCTCTGGA^ 
CCCAGTGTTGTGCTMTCTCCATC^ 

AAAGAGCAAAGCAMTGGAGATGCCTGGTCCrTACTGGGCCATC 
GCCC LI I IL I I I I I GGAMCAGGGMGAGTCTAGAGGGrrGAAAAACACCCAGTAAGACA, 
CTGGGAGCAGTGAMTTTCATTCC^^^ 

10363 AGCCTACCQVCalXTITGTCKTM 

CTGCTGAMGAGCAAAGCAAATGGAGATGCCTGG^ 

GGGAAAGCCC LI I I LI I I I I GGAAACAGGGAAGAGTCTAGAGGGTTGAAAAACACCCACT 
MGACACTGQGAGCAGTGAMTTTCATTCCATAGTGAGAM 
TGGGTGATGCTGCAGAMGAMTCAATTCACCrCCTGTG^ 
[G,A] 

CTCTGTGATTCATT(TGGCATCTG\GAGTTAGGGATC 
ACCCCATGCTTGGGAAGTTTACACAGCAGTAGCTACT 
CCCOXKlGNACrACTCCAT^ 

AAMTTGGAGACTTGAGAGCAGAGAAGACTGMGGCAGATTA^ 
GMGACTTCCMTTCATCCCCAGTATG^ 
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10684 TCTCAGAGTTAGGGATGAAATGAGMTGTTGC^ 
ACACAGCAGTAGCTACTCCAGCAGCTO 

TCCCCCMTCMGTCAMCrGTCCATAMTAGMTAAMTAAMTTGGAGAaTG^ 

AGAGAAGACTGAAGGCAGATTATCTTTATAGAATAACT^ 

CAGTATGATCACGATAGMGGAAAAAATGACTMGCAG^ 

[T,C] 

TGCGTMGTATTTATTTTTACAAGATTGTCTTA 
TTTTCCACCATGCCTGMCTGGCACAMGMT^ 
ATCTCATTCAAATATCCCACGGGCATTTTTACC^ 
ATCMGGTAGGCTCCTTTCMCAAMTCTACCTGAGGATCTC^ 
TTATTATTTTCAAATCT ACTGTAMGTAAMCTAGGAMTTTAGATAAMTCTATAGAAC 

TCCTTTCMCAAAATGTACCTGAGGATCTCATT^ 
MTCTACTCTAMCTAAMCTAGGAMmAGATAAM 
GGTATGTGCTTGTGTATGTGTGTCCCTGCGTGTGCGCATGTCT 

GGTTCTGTMTACMTTTACTATA^ 

CTAGCTGAACTGAGTGCTATATGACAACAAGGA 1 1 1 1 ILI IU 1 1 ICCCAAGTbi mil 
[G,T] 

TTCCATTTAGTGVGGTAGGTCMTGMTTCACATTGCCCAM 
ACCCATMTCAa"GATGTGTCCMTTTTGACATTAGAAAMCCTGAT^ 
CCAATATGGAMCTTGCCCTMTMCXAAAGCTAA^ 
GCTCMGTATTMTTCAMTATTTAT^ 

TTGCCMTTGTGGATTTGGGATTTTATCTATTAMGGG I I I I II I 1 1 1 1 1 1 I CTCTTTGC 

TTTMGTCCCATATCCTGCTCTrTTCTTCCGrCAG^ 
ACCAGGAATCCCCATCCMGTTTACTTTCCCMCTCCTGGMGT^ 
TTGTGACATTATCATATCTITTCTGrrCAATGGTTGC^ 
ACTTTTCAGCCTGAGAGCTGGCrMTCTGGGACAGTACT 
TMCATGGAAMCCCCGATTTTCCC^ 
[T,C] 

GTTTTACATTTCATACCMTTMTGAGAAAAAAATATTGGG 
TACAGGGMGCrTCACTATGGAGMGT^ 
CTTGTAMTMTATTTGATACTCTTCCTCATCTGG 
TGMTMTTTCGTCTCCTTGACreMTCAGTAAGrACA 

TTTCCTAGMTGAMGAMTGTCMGMGTCTGMGATGATTC1TGM I Mill 

12349 AGTCCCATATCCTGCTCrriTGTCCGTCAGTTTCCCCCAGM 
GGAATCCCCATCCAAGTTTACTTTCCCAACTC^ 
GACATTATCATATCTTTTCTGTTCAAT<5GTTGCTrCT 
TTC/kGCCTGAGAGCTCGCTAATCTGGGACAGTACrCGM 
ATGGAAMCCCCGATTTTCCCTTATATTCMGGTATTATTTGACOT 

[C,T] 

TACATTTCATACCMTTMTGAGAAAAA^ 
GGGMGCTTCACTATGGAGMGTGMTTTGGGATTGA 
TAMTMTATTTGATACTCTTCCrCATCrGGAGACAGVrrCCT 
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TMTTTGCTCTCCrTGACTGMTCAGTMCT 

CTACW\TGAMGAMTCTCMGMGTCTGAAGATGATTCrrGMTmGGI I I I I IGCTA 

13115 TAGMGATMGAAMCGAAGATAGCXTCTACCAAMTC 
TGATATOTAGCGMTTTATGTCCrrATGQGC^^ 

TATGT ATGATAATTATAGGGCG^TTTGATACCTT MGAMTTCCAGCTTTCCTTTGACTC 

ATTTTGATATATCTATTTACTOTATAMTTCATA^ 

TTTTTTTTTGCTITTAAAMTGTTTATGGGTATAT 

[C,T] 

ATATATTTTGATATMCO\TACMTGTGW 

TCACCTCAAGCATTTATCA i MCI 1 1 MG I 1 AGAGACATTCTMTTTGACTCTrCTAGTT 
A7TTTGAMTATACMTGMTTATTGTTM 
TTAGTCCTTCTMCGGTATTTTGGTACCCATTMCC^ 
CCCTACTACCTTTCCCAGCCTCTGGrMCCATCATTCTTCT 

13354 Al I I I I I I I I GCTTTTAAAAATGrrTT ATGGGTATATAATAGTTGT ACATATTTATGAGAC 
ACATATATTTTGATATMGCATACMTGTGTMTGACCAM 

CATCACCTCMGCATTTATCA I I I < I 1 1 I PG1 I AGAGACATTCTMTTTGACTCTTCTAG 

TTATTTrGAAATATACMTGMTTATTGTTMCTATAGTCATCCTATr 

CTTTAGTCCrrCTMCGGTATTTTGGTACCCATTMCCMTGCCT 

[T,A] 

cccctactacctttccg^gcctctggt aaccatcattcttctcact atctctataaggtc 
ag 1 1 1 1 1 1 1 1 1 amctcccctatatgagtgagaacatgcagtatttgtc. 1 1 i i igtgcct 
ggcrratttcacttmtgtmtgttctctmtttcatccacattattgc^ 
tttcattci i li i atggctgtctatatgtaccacattttatttatccactcatctgttga 
tggagacttaggctgatttcatatcttggtcattgtgmtagtg 

13373 mtgtttatgggtatatmtagttgtacatat™ 

catacmtgtgtmtgaccamtcagggtmttgggatatcca^ 
caiiiliiiiigi i agagacattctmtttg^ctcttct 
gmttattgttmctatagtcatcctattgtgcatgccagacmagtccttctm 
attttggtacccattmccmtgcctcrrtatccttccccg^cccct act acctttccca 

[C,G] 

CCTCTGGTMCCATC^TTCTTCTCACTA^ I I I I I I 1 1 IAAACTCCC 

CTATATGAGTGAGAACATGCAGTATTTGTL I I 1 1 I GTGCCTGGCTTATTTCAGTAATGT 
MTGTTCTCTMTTTCATCCACATTATTGCAMTGACATGATT^ 1 1 CI IATGGCT 
GTCTATATGTACCACATTTTATTTATCCACrCATCT 
CATATCTTGGTCATTGTGMTAGTGCTGTACT 

14677 AGAGATAGAGATCTMTTTCATTCTTCTGCATATGGATATCTAGTT^ 

TC1TGTGGAMTTGTCCTTTGCCCAATGTATC I I CI IGATGCCI I IGI IGAAAATTAGTT 
GACTATAAATGTGTGGATTTATTTGTGGG 1 1 CI I I ATTCTGTTCCATTGGTCTATGTGTC 
TGI I I I I ATGCCAGTATCATGCAGTTTTGATTATTACAG AGT ftTAATTTGAAGT 
CAGGTCATGTGATGCCTCCAGC 1 1 IGI I CI 1 1 1 1 1 CTCAGMTCTTATATTTAGAAAAAC 
[C,G] 
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TAMGACTCCMCAAAAMCCTGCTAGMCTGA^ 
ACMC^TCMCATACAA^ 

AAAMGAMGAAAAAAAMCMGAMTMTCCCATTT^^ 

ACACCTAGGAATAMCCATACCAMGMGTGAAAGA^ 

ACTGATGAMGAMTTGAAMTGACATT^ 

14734 Al 1 1 CI I GTGGAMTTGTCCTTTGCCCAATGTATG I ICI IGATGCCI I IGI IGAAAATTA 
GTTGACTATAMTGTGTGGATTTATTTGTGQb I I <_ I I I ATTCTGTTCCATTGGTCrATGT 
GTCTGI I 1 1 1 ATGCCAGTATCATG^GrrTTGATTATTACAGGTTTGTAGTATMTTT 
AGTCAGGTCATGTGATGCCTCCAGC 1 1 IGI I CI 1 1 1 1 1 CTCAGAATCTTATA"TTTA6AAA 
MCGTAMGACTCCMCAAAAMCCTGCT AGMCTGATAMCAMTTCATTAMTTTGCA 
[6 1 A] 

GATACMCATCMCATACAAMTTCAGCAGCA 

AAAAAAMGAMGAAAAAAAMCAAGAAATMTC^ 

AAMCACCTAGGMTAMCCATACCA^ 

MCACTGATGAMGAMTTGAAMTGACATTAAAAMTGGAAAGGT ATTCCATGTTCATG 
GATTGCMGMTCMTATTGTTAAMTGTC^ 

14747 ATTGTCCnTGCCCAATGTATC I TCP GATGCC 1 1 IGI I GAAMTTAGTTGACTATAAAT 
GTGTGGATTTATTTGTGGG I ILI I I ATTCrGTTCCATTGGrCTATGTGTCTC, I I I I IATG 
CCAGTATCATGCAGTTTTGATTAT^ 

GATGCCTCCAGCI I IGI ICI 1 1 1 I I CTCAGAATCTT ATATTTAGAAAAACGTAAAGACTC 

CMCAAAAMCCTGCTAGMCT6ATAMCAMTTCATTAM 

CA,G] 

CATACAAAATTCAGCAGCATTTCMTATGCGWSAG 
AAAAAAAMCMGAMTMTCCCATTTATMTAGCTAQ^ 

ATAMCCATACCAAAGMGTGAMGATTTCT ACMTGAAMCTATAAAAG^CTGATGAAA 

GAMTTGAAMTGACATTAAAAMTGGAMGGTATC 

MTATTGTTAAMTGTCCATATGATCC^AMCMTCTAC^ 

14808 TGTGGATTTATTTGTGGG I ICI I I ATTCTGTTCCATTGGTCTATGTGTCTG I I I I IATGC 
CACTATCATGCAGTTTTGATTAT^ 

ATGCCTCCAGCI I IGI ICI I I I I I CTCAGMTCTTATATTTAGAAAMCGTAAAGACTCC 

MCAAAAMCCTGCTAGAACTGATAMCAMTTCATTAMT^ 

CATACAAMTTCAGCAGCATTTt^W 

C-.A] 

AAAAAAMCMGAMTMTCCCATTTATMTAGCTACAMTAAM 
TAMCCATACCAMGMGTGAMGATTTCT ACAATGAAMCTATAAAACACTGATGAAAG 
AMTTGAAMTGACATTAAAAMTGGAMGGTATTCCATGTT 
ATATTGTTAAMTGTCCATATGATCCAAMCMTCTACAGAT^ 
AMTACCMTGACATTCTTCATTGAMTAAA^ 

15086 MTMTCTTAAAAAAMGAMGAAAAAA^ 

AAATAAMTAAMCACCTAGGMTAMCCATACCAMGAAGTGAM 
AMCTATAAAACACTGATGAAAGAMTTGAAMTGACATTAAAA^ 
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ATGTTTCATO^TTGCM 

CAGATTCMTGCMTCCCTAT^ 

[-,A,G] 

CCTAAMTTTMGTGGMCCATGMGGTAGATCTCTGCrATAC^^ ACT 
CAACAAACCTTGMTATGAAGACTGGQGAAGTGAATAGGCAGCrrC^ 
TGGTGAMTTTAGGAGAATGGATGTTTTATAATGGGT AGCAL 1 1 1 (_ 1 1 ACATGTTCTCAA 
TCAGCCATMCTTACTACACT^ 

ATAAMTCCTAAAAMGGAG^GMGCACATATAMCCTGCGTC™ 

15414 TAGATGTCTKTATACATAGM 

GMGTGMTAGGCAGCTTCACTCTrCTATTCCCrGGTGAMTT^ 
TATMTGGGTAGCAL I 1 1 LI I ACATGTTCTCMTCAGCCATMCTrACTACAGTCMTTT 
GMTmTTGCATTTGAATATATTGGATTAAAMTAA^ 

CATATAMCCTGCGrCTTATTTCATGTGTTCC I I I LI I ICTGGGTGALI I I ILI I I IGAA 
[A,G] 

TAAMCCTGCAAMTMCAGGACAGG^ 

AGCAGCAGTCCTG I I I lAT^CCrcrrC^TTTrCTGTTATTGAGMTTC^AGMGMGGA 
GGAGGAAGAC^CACATC^ ACTGT ATTCCA 
MTAGCAjQCCMTO\GG^^ 

TATTCTCT AAGAAGCT AAATTGTGTT AGACTGAMCCCATMGGMCCATrGTTCAAAGT 

15722 TGCAAMTMCAGGACAGGGTGGMGGGAGATGGGATCCCCTC^ 
GTCCTGTTTTATCACCTCTTCATTTTCrGTTATT 
GAGTTCACATCCACAGACTGGTGTGGTTGMTAGTTGTCTCTACT 
GCCMTGAGGCTGTTACAGTGMGCCAGTCCCAAGATMTTGTTCT 
TMGMGCTAMTTGTGTTAGACTGAMCCCATMGGMCCATTG^ 
[T,C] 

TCAAAAGTAAAGA I I 1 1 I MTAGTTTCTCrTAATTAGATTATTTTCTMG^ 
ATGATTACTATTTTATCTCTATMTTTT^ 
CCTTTGGAAAAMTTGGCTTTTAG^^ 
AGCCTAGGAAATTGGTACTATGACTTTOGTATGT^ 
ACTCAMGATGTT AMTATGCTGGCCMCTTCACAM 

15861 GGTGTGGTTGMTAGTTGTCTCTACTGTATTCCAMTAGCAG^ 

TGMGCCAGTCCCMGATMTTGTTCTGTACCCCTATTCTCTMGA^ 

AGACTGAMCCCATMGGAACCATTGTTCAMGTTGGL I I LI I CAAAAGTAAAGATTTTT 

MTAGTTTCrCrrMTTAGATTATTTTCTMGACATAGMTTATG^ 

CTATMTTTTCATCTCTATMCGTTTACA^ 

[T,C] 

TTTAGCTTTACrTTTGCMTATTTTATT^ 

ATGACTTTTAGTATGTTCATTTMTAGATGAAMC^ AAATAT 

GGTGGCCMGTTCACAMGCTGATCATTMCM 

GATTTMTCTGTGACAGTGCACLTGGGTGCGCATG^ 

TAGAACCTTTCCT AGTTGGCTTTGCTCCATGATGACCATTACTGTTCLTTCr^ 
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m 



16264 CTCAAAGATGTT/WVTATGCTX3GG 

CTGAACTCCTGGTTTTCTXaATTTMTC 
CACCCCCACACTTGCACATAGMOT 
TGTTCCTTCTACTTCAAMTMGCA^ 
CTGTCAAGCAKCCATTCCATTAGTCAGCrTGTGGlTCACTL^ 

[a,t] 

MTGGTATATTTATCTAGATMTTCTAOT 

TMTTCTGTGCATCATTTTTCTCrGATTCTGAM 

TAATATGAGTTTTATCTCCTTTAGAGTCGMTGGATGTG^ 

GGTTCATCAGTACACAACATTCT^^ 
ATTCCAATCCTTATTTTCMTATATTTAAAAAGACA^ 

16314 ACMCAGGGCCrGMCrCCTGGTTTTCTTGATTTMTCT 
ATGO\TQCATCACCCCCAOVGTX3CACATAGA 

TGACCATTACrGrrCOTCrACrrOW^TMGCAMTTATCCT ACAGATTCAGAGCTGG 

TACAGGrereaxnwGON^ 

GTATTGACCTAMTGGTATATTTATCTAGATMTTCrACL I ibi 1 ATTTTCAAAGCCCCA 
[G,A] 

TCTTGTTTGCTMTTCTGTGCATCAT^ 
(KAATTGCrcTMTATGAGrriTA^^ 

TGCrCCCACTQGTTCATCAGT AG\C^CATTCTGCATATAAMC^GGrAGAGTCTT AGTC 

ATGGAAMCCATTCG\ATCCTTATTTTCAATATATTTAAAM 

AAOVGQCCTACCCTAAGMTCTrAAGAQCTTGCTTCC^ 

16877 TAAGAGCTTQCTTCCAGrrTGTCCrrGCTGCC^ 

TMGAGAMG^TGrrATGGTA^GACCMGTAGATGACATAMTGMG^CG^CCrTAAA 

TCAGAGTTTTAAAMTAGGCCCrGMCTGMGCMGAGOTAMCTAGGGM 

GMCTGAGACTTCTCCAGAGAGAAGTATCTGGGATTTAAL I 1 1 CTAATGAGGCTTGG 

TTTTCCATCAACTTTTCCTTTAAA^ 

[A,G] 

TTTGTCATMTTGTAAMTGGGTGGTTACATCCTTCT 

GTCCTAGC^TACAGCATTTTTCTAAMTTTGCT 

TATTCTTTTTCTAAAAMC^TTTGTTTCAGCITTACCACT 

GACTGQGGAMTGACGCTGATMTATGAAACATTACMTCAG 

CCCG^GCATGCTGATTTTGATAMTTATMTAAAAMTTATTTGAGGGT 

16966 AGTAGATGACATAMTGAACACCACCTTAMTC^^ 

MGCAAGAGGTAMCTAGGGAAGCCTCAGGAGMCTGAGACT 
TGGGATTTMCTTCTTTCTMTGAGGCTTGGT^ 
GGGGGTATTGCrCATCTTTCTGTTCAGCC^ 
CATCCrTCTGGTGATCTAQGAGCCCTATTTTCGTCCTA^ 

[T,G] 

TGCTGTTAGCTTTCATG^TTCTTACCCTMCTA 

GCTTTACCACTCTGAT^ 

ACATTACMTCAGGTGAGCTATTTACAGTMCCCCAGCATGCTGATTTT 
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ATAAAAMTTATITC^G^ 

TAGAA U I 1 1 1 I I rA AAAAAATTTTMTTTTMTTTTMTTTATTTCAGA 

17147 GGGGTATTGCTCATCTTTCTGTTGAGCCCCA^ 

ATCGTCTQGTGATCTAGGAGCCCrATTrrCGTCCTAGCAT^ 
TGCTGTTAGCnTCATGATTCTTACCCTMCTATTC. I 1 1 I ICTAAAAAACAl I lb I I iCA 
GCTITACCACrcrGATGMTT<^GAG<XrATGACrGGGGAMTGA^rGATMTAT^ 

ACATTACAATCAGGTGAGCTATTTACAGT^ 

[A,G] 

TAAAAMTrATTTGAGGGTGGAMGACTCCrACCTGTCATTTGGTGGCATT^ 
AGAA CI I I I I I I I AAAAAMTTTTMTTTTMTTTTMTTTATTTCAGAAMTT^ 
TTAMGMGCATATACAMGAMCTTACATCATGTGTMTCCTTC 
AGATGTACTMCATTTTGGTGTATTTATTCCMTTTTCT 
CAAOTTTMTCrrTCTATTTTACrrAAGCTATACT 

ATCTAGGAGCCCTATTTTCGTCCTAG 

TCArGATTCTTACCCTMCTATTCTTTTTCTAAAAAACA I I IGI I I G^GCTTTACCACTC 
TGATGAATTCAGAGCmTGACTGQGGAAATGACGCTGATMTATGA^ 
GGTGAGCTATTTACAGTMCCCCAGCAT^ 

TTGAGGGTGGAMGACTCCTACCTGTCATTTGGTGGCATTTO^ I 

[T,C] 

TAAAAAAATTTTMTTrTAATTITAATTrATT^ 
ATACAAAGAAACTTACATCATGTGTMTCCrrcCATCCAG 

ATTTTGGTGTATrrATTCCMTITTCTCAGTATr ATATTGCT1TT AGACAACTTTTAATC 
TTTCTATTTTAaTMGCTATAGTMGAGATMCTMTATMCTG^GGGA I 1 1 I lAAATG 
CATTTITMTGGCTACATMTAGAMTTATT^ 

AAMTCAAACAAMTCAACACGCACATTCAAGATCATTATG 
GAGAGTGTTMTGTCCTTAGMTTTGGCCACAGTTAGCTGGTCCTACTCT 
QGTCCTATTTTGTGMTTMTCTCATTTCA^ 
ACTAGTCXCAACAGTTTGCTCTCT 
TTTTATTGAGTATMGAGMTTMCCCATGTM 
[A,G] 

TTTTGTTCACCAGTGTTTTCTCATCTTG^ 

TCTATGTGTTG(^TTMTGAAATTTCTTMC1TTAATCT ACCTCAAAATGTCTCTATCTT 
CTTGATTCTCTCCTTCCTTTCTCTATCAGAAMTG^ 
TCCGGTCCTGTGCCCTTGATCCCATCT^ 
TCCTGTCCCTTATGAAAAACAAGCAAGACCATCAATTC^ 

18655 TCAAGATG^TTATGGTCAAGTACTAAAGTATGTGAGA 
CGVCAGTTAGCTGGTCCTACTCTGCTCCAAGCCGOT 
TGATGCCMTTTTTATTACATTCTCTCCAAAAMCTAGTCT 
CAAGTTCACAGCATTATCTCTGCTATATCrATAT^ 
ATGTMGCTCCATGAGGGTAGGGATTTCTC^^ 
[T,G] 
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GMGVCTACATGACMTT/VCTC 

TMcrrrMTcrACcrow^TGTCTCTATcrrcrrGATTCT 

AGAAMTGATQGTCCTCTTATTTTCCAAGTTATTCCGGTC 
OTCT(^C1TCCCCTTCa^^ 

ACGVrCMTTCTATGWrrrATCATTATGTCACTCTGTTCTTATC^ 1 1 1 1 ACTA 
CACTATCTATCTGTTGCATTMTGAM 

ATCTTCTTGATTCTCTCCTTCCTTTCTCTATCAGAAMTGATGCTCCT 
GTTATTCCGCTCCTCTGCCCTTGATCCCATCTCTTCTCACrrc 
CATTCTCCTCTCCGn^TGAAA^ 

CTCACTCTCTTCrrAT<^CATATTTTTACTATT(^(^GGGC I I C ITCTACTTACTCCT 

[G,T] 

AACCrTCTACAATCTAGTTTAGCTCTTCATCTTTT^ 

ACCCATGGCTTTTMTTGCQWCTT^ 
TTCCTTACCACACTCTCCTT^^ 
TTTCTGATTTTCClTCTGTTTCTGATTGrrrCCTTTTCT 

TTCCACCTCTCTGAAATCATTAGCATTC 

CGTT ACCACAGTXTCCTTGAAACrCAGTCCCCTGACrra 
TCTGATTTTCCTTCTGrrTTCTGATTGTTCCTT^ 
CCAGCTOOGAMTCATTAGCATTCCCCAAG^ 

GAGMCTCAC^TAGCrrrAATTTGGACG^TTTCTATGGCTT ATCTAGAl 1 1 1 1 iCAGGA 
C1TGCCITCMCCTATTCTTTCTCTAGCTGATTCCATTMCT 

GAAGACAGACCTCCGAGAAATGACCCTTCTCTCCAAMCTTCCGCMTATCT 
CCTAGCCTGAG^TTCAGACTTTGATTATCTGCCTCCMGlTrATATCCT 
TTATATATTCTGTTCTCCAGCTACACT^ 
ACTCTTCCTGCCTCCCACTCACCCTCATCT 

CTCATTTCACAC^CCCCTCTTTCTATGAAGCCCTCAGCTGGAAATAAI I 1 1 1 iGCCTTT 

19531 CTCTCTGAMTCATTAGCATTCCC^GGATTCTTCAAAACTCT 

ACTCAGCATAGCTITMTTTGGACCATTTCrATGGCTTATCTAGA I 1 1 1 1 iCAGGACTTG 

CCTTCMCCTATTCTTTCTCTAGCTG 

GACAGACCTCCGAGAWGACCCTTCTCTCCAAMCTrCCGCMTATCT 

A(KXTGACATTCAGACTTTGATTA^ 

[T,C] 

ATATTCTCTTCTCCAGCTACACTGC^^ 
TTCCTGCCTCCCACTCACCCTCATCT^ 

TTTCAG\GGACCCCTCTTTCTATGMGCCCTCAGCTGGAAATAA I 1 1 1 l iGCLl l l I I I i 

CCATTTTATTTTTGGACTGTITTATGGCATTTAAC^^ 

GCCTTGCTCCCTCTTTTGCAMTTT^ 

19911 CTCATCTCTGCTCTCAAMTGCMCC^^ 
CTATGAAGCCCTCAGCTGGAAA^ 
TTTATGGG^TTTMCATACGTAGTTCTATACAMTATTT 
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AAATTTCTTAAAQGT AGAGACCATTGTATG 1 1 1 1 LI I CATATGTTGCTGGTGCCTAACAG 

MCTATGGCCATTGTCQVCATTCA^ 

[C,T] 

CTCTCATGMTGCCCTTGCTTTCTCT^^ 

GCCATGAAAGTGCCTACTGCTATTT^^ 

GATGTGGCCAGGATACTCCCTC^ 

TGGMCCACTTTGATTTTGTCTGGGGC^ 

ATAGCmMTGAAGGCATATTCCTAMTGCMTGCATTTACTm 

20199 TTTGAGGAGCITCCrcrCATGMTGCCCrrGCTTTCT 
ATATGACCTGACTGCCATGAMGTGCCT^ 
CGTMCACCCCAGGATGTGGCCAGGATAC^^ 
QCTA7TQCO\GATTQGAACCAlGTTGATTTTGTCT 
GTACAGTXyWVTCATAGCTTTMTGAAGG 
[A,G] 

ATTAAAAGTTXKTTCCAAGCCCATAAGGGACnTAGAAAAM 

TTGTCCCCCAGGACCCTGGGGGAG^ 

TAGTGTTATTTATGTTTAGAGACATCT^ 

ATGAGGTAGATTAGGCAAAAAGATAAACAAGTTGCTACrCT ATCTGGCATTTAAGTCTAA 
TTAMTTGTMTTTTTAGGGCATACCATGMGTATAGAMTGTCr 

20243 AGACTCATCCCCCTATATATGA^ 

GTGGACATGATGTCCTCGTMCACCCCAGGATGTGGCCAGGATACrCCCTC^ 
GTCTTC^TTACTTTMGCTATTGCCAGATTGGMCCA 
ATGCCCCTCAACGGATGTACAGTGAAAT^ 
MTGCATTTACITTTCMTTAAMGITGCITCC^ 

L G ' A ^ . ... 

GTMCGAACMTGAGGnTGTCCCC(^GCACCCrGGGGGAGATGCAG\GTGGAGTCTG^ 

TCCMGTCMTTGTGTTAGTGTTATTTATGTTTAG^ 

CAGGTCCTTATAMCMTGAGGTAGATTAGGCAAAMGATAMG 

TGGGVnTAAGTCTMTTAMTTGTMTTTTTAGQGCA 

TGMGCrrCAMGGMCAGTGAMTTCCTTTMGGTCCTATATGGAMCCT 

20640 GACATCTTTGCATGGGACCATCTA^ 

AGATAMO^GTTGCTACTCTATCrGGCATTTAAGTCr AATTAAATTGTAA I l l l lAGGG 
CATACCATGAAGTATAGAAATGTCTGAAGCrTCAMQGAACAG^ 
CCTATATGGAMCCrCTGTTGTCATTTTATTTATATGGATTGCT 
TGTGGGATTAGGAGGAGGGCCTGTMCTTCTTTATAAAAG I MCI IAGCTATCCTGAAGA 
[T,C] 

GTATAGACATTTTTACTTTTTTAGGTATTTTC^ 

AGATTCTTCCAGAGAAGCCCTCI I I ILI I ACAATCTTATCCCTGGCT ATCTGCGTAAACG 
GAATCTTGMCCCATMTAGGATACATGT^^ 

TTGTAGVGCATCMTATCATTTTATMTCATAGGGAGGC I ILI I IU I I AGCATGTAATG 
CCCCCTTTACAGG LI 1 1 1 ILI ILI I 1 G AGGGGTTTGMCATTCCATGAAAMCTGACAGA 
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21156 AGGU IU I IGI I I AGCATGT AATGCCCCCTTTACAGGL 1 I 1 1 IGI ILI I IGAGGGGTTT 
GMO\TTCCATGAAAMCTGACAGATAGGAM 

GMGCAGAMGrACTAGGCTAGATAGrcrCTAAACATrAAGTA I 1 1 ILI ICCTCCATCTT 
AAMGGVVTGAGMGCC^CCAAMTATrrrACCTMTGGAM 1 1 I I 

GTMCCACCACTITGGCTGCTACATAGA^ 
[G,C] 

AGG\AGTCTGTAMTCTGATCAAGTGTTCTGATGCAQGCT 
GAGATGATCCTTGGAAMTCCAGAGCCAGC^^ 
TCG^CMGCTGCTGGCCCCTGGACXCATTCTrcrCrCAAMCTA 
TGTATACGTATTGATQGGGMTMTGGTCACTATGAAMCC^TGTGATMTATGGAAAM 
TACCCATGATATMTGTTATGTGMGAGMGAAMTGAAACTGGT AGAACT ATGTGATTG 

21163 1 1 IGI I I AGCATGTAATGCCCCCTTTACAGGL I ITTTGl ILI I I GAGGGGTTTGAACATT 
CCATGAAAMCTGACAGATAGGAMCTGACMTAAMGATTGAGCT 
AMGTACTAGGCTAGATAGTCTCTAAACATTAAGTA 1 1 I ILI I CCTCCATCTTAAAAGCA 
ATGAGMGICACCAAAATATTTTACCTMTG^ 1 1 I I IGTAACCA 

CCALTTTGQCTGCT AO^TAGAGMTGGATTAGMGATGCCAACAAMGATTCTGAGCM 
[A,T] 

(TGTAMTCTGATCMGTGTTCTGATO^ 

TCCTTGGAAAATCCAGAGCCAGCTCCATMTACTTTCLTGCT 

GCTGCTGGCCCCTGGAGCCATTClTCrCTCAAAACT 

GTATTGATQGGGMTMTGGTCACTATGAAMCCATGTGATMTATGGAA 

GATATMTGTTATCTGAA(^G 

2142 5 MTQGATTAGMGATGCCMCAAMGATTCTGAGC^GTLT^ 
LTGATGCAQGCTGATATCCTTCTGTGCrMGAGAGATGATC 
GCTCCATMTACTTTCCTGCTCTGCTGGCAM 

TCITCTCTCAAAACT AGCATTCATCA^TTTMTGTATACGTATTGATGGGGMTM 

OVCTATGAAMCCATGTGATMTATGGAAAMTACCCATGATATMTGTTATCT 

[G,A] 

AGAAMTGAMCTGGTAGMCTATCTGATTGC^ 
ATGACTTTATAAMTATTTGTATATMTGAAMCTGM 

TTGTGTCAQGGTAGTAACATGATGAGTGATTAATAG 1 1 1 1 1 MTTTTTAATATAGTAATG 
ACATMTGTTACMCTTGTCCAMTCTCACAMCATMTA^ 

ATAAAAGAATACATATTTTATTATACA I I I I I ATGTAGGCTAATTGATGGTTCTGAAAGC 

chromosome map: 
Chromosome 10 
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